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Abstract 

We study several fundamental properties of a class of stochastic processes called spatial A- 
coalescents. In these models, a number of particles perform independent random walks on some 
underlying graph G. In addition, particles on the same vertex merge randomly according to a 
given coalescing mechanism. A remarkable property of mean-field coalcscent processes is that 
they may come down from infinity, meaning that, starting with an infinite number of particles, 
only a finite number remains after any positive amount of time, almost surely. We show here 
however that, in the spatial setting, on any infinite and bounded-degree graph, the total number 
of particles will always remain infinite at all times, almost surely. Moreover, if G = Z"^, and the 
coalescing mechanism is Kingman's coaleseent, then starting with N particles at the origin, the 
total number of particles remaining is of order (log* N)'^ at any fixed positive time (where log* 
is the inverse tower function) . At sufficiently large times the total number of particles is of order 
(log* N)'^~'^, when d > 2. We provide parallel results in the recurrent case d = 2. The spatial 
Beta-coalescents behave similarly, where log log TV is replacing log* N. 
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1 Introduction 



1.1 Motivation and main results 

The theory of stochastic coalescent processes has expanded considerably in the last decade, as a 
consequence of their deep connections to population genetics, spin glass models and polymers. In 
theoretical population genetics, coalescents arise as natural models of merging of ancestral lineages 
(see, for example, [17^ [9]). A particular A-coalescent, usually called the Bolthausen-Sznitman coa- 
lescent, is thought to be an important object for describing the conjectured universal ultrametric 
structure of numerous mean-field spin glass models including the Sherrington-Kirkpatrick model 
(see \12\ [T3l I37j). The same coalescent has also been recently linked in [15] to scaling limits of 
directed polymers. 

The A-coalescents are stochastic processes taking values in V, the space of partitions of N = 
{1,2,...}. In the current context, each class (or block) in 11 € can and will be thought of 
as a particle. For any coalescent process (nj,t > 0) it is true that Ht+s is a coarsening of IIj, 
for any t,s > 0. There is a natural semi-group structure ("P,*), where 11*11' is the result of 
merging the blocks of 11 "according to" the partition 11'. The A-coalescents can be canonically 
characterized as the Levy processes in ('P,*), with the property that no two coagulation events 
occur simultaneously. The above Levy property corresponds to the fact that Ht+s = *II'g, for 
all t,s >0, where (n^,t > 0) is an independent, identically distributed process (see, e.g. [6l §3.1.3] 
for details). A direct construction of A-coalescents from [33] is now considered standard in the 
probability literature. As will be discussed in more detail below, each A-coalescent corresponds 
uniquely to a finite measures A on [0,1]: for instance, the case where A = (5|o} gives the well- 
known Kingman coalescent from mathematical population genetics, while the uniform measure on 
[0, 1] gives the Bolthausen-Sznitman coalescent. These two processes, as well as the more general 
Beta-coalescents are especially interesting and amenable for analysis, since they are self-similar in 
a certain sense made precise by the results of [4] . 

The present work is devoted to the study of several fundamental properties of a more general 
class of models, introduced in (31] and called the spatial A-coalescents. In this setting, particles (i.e., 
partition classes) are positioned on some underlying locally finite graph G = {V, E). The dynamics 
of the process is enriched in the presence of the geographical structure in two ways. First, the 
particles move as independent continuous time simple random walks on G. Second, stochastically 
independent A-coalescence takes place on each site of G. More precisely, at any given time, only 
particles that are on a same site can coagulate. Moreover, at every site the coalescence mechanism is 
that of the original (mean- field) A-coalescent. The spatial Kingman coalescent is a natural model of 
an interacting particle system where particles perform independent random walks on an underlying 
graph, and any pair of particles coalesce at rate 1, as long as they are located at the same site. 

The spatial A-coalescent processes are particularly well suited to model merging of ancestral 
lineages for a population that is evolving in a geographical space G, where the spatial motion of 
individuals is taken into account. In this way, geographical factors such as isolation and overpopula- 
tion can influence the dynamics, making it a more realistic model for long-term population behavior. 
In the above interpretation, the vertices of the graph are referred to as demes and represent a dis- 
cretization of physical space. Each edge represents potential migratory routes between two adjacent 
demes. 

While the mean-field A-coalescent processes are relatively well understood at this point, even 
basic properties of their spatial counterparts are much more delicate to analyze. Intuitively, the 
difficulty comes from the fact that the two ingredients in the dynamics, the coalescence and the 
migration, affect the particles in the system in opposite directions: the spatial motion makes par- 
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tides diffuse away from one another, and the coalescence keeps them together. Indeed, our results 
show that the competition between these two forces can be very tight. Our main results provide 
information about the limiting behavior of spatial A-coalescents as the initial number of particles 
n tends to infinity, at both small and large time-scales. We consider the case where initially all 
the particles are located at the origin o of G. For some of our results, the only assumptions on G 
are that it is connected and has bounded degree A = max„gy degree(f ) < oo. However, several 
of our more precise results on the asymptotic behavior are restricted to the setting where G is the 
d-dimensional lattice Z'^. 

Define the function log* n as the inverse log* n := inf{m > 1 : Tow(m, 1) > n} of the tower 
function, where Tow(0, x) = x and 

Tow(n,x) = e^°™("-i'^) = £^ . (1) 

n iterations 

Theorem 1.1. Fix e > 0, and consider the spatial Kingman coalescent on a graph G with hounded 
degrees. Start with n particles located at a € G, and let A^"(t) be the total number of particles at 
time t > 0. There are constants C, c > depending only on t and the degree bound such that 

pfAf"(t) >cVolB(o,(l-e)log*n)') > 1, and 

pfAr"(t) < C7Vol5(o,(l + e)log*n)') ^ 1. 

A more concise way of stating Theorem II. II is that N^(t) x Vol B{o, (1 + o(l)) log* n) with high 
probability (see the paragraph on "Other notations" at the end of Section 11.31 for x and other 
notations related to asymptotic behavior). 

Remark. The function log* n tends to infinity with n, but at a very slow rate: log* n < 4 for 
n < lo^^^^^^*^. Thus while this function diverges to infinity from the mathematically rigorous point 
of view, for all practical sample sizes it takes value 3 or 4. 

Remark. The behavior in Theorem 11.11 contrasts that of the mean-field case, where N'^{t) converges 
(without renormalization) to a finite random variable N{t) for all t > 0, due to well-known properties 
of Kingman coalescent. In the lattice case G = Z"^, we see that N"'{t) diverges as (log* n)*^, i.e. 
extremely slowly. Even on a regular tree, where balls have maximal volume given the degree, N^(t) 
diverges only as 6*^^°^ 

The mean-field A-coalescent processes can be classified according to the coming down from 
infinity (GDI) property. For the partition- valued process (IIj, i > 0), this means that the initial 
configuration IIq = {{i} : i G N} is countably infinite, but that IIj contains only finitely many 
classes at any time t > 0, almost surely. The Kingman and the Beta-coalescents with parameter in 
a certain range (see below) come down from infinity, while the Bolthausen-Sznitman coalescent does 
not. If the spatial coalescent is viewed as an interacting particle system, then GDI is the property 
that the total number of particles in the system at any fixed positive time remains bounded (tight), 
as the initial number of particles tends to oo. It turns out that for the mean-field model (as well as 
the spatial model with G a finite graph), either the total number of particles in the system converges 
almost surely to a finite random variable, or it diverges at any given time. 

A natural question, and one of the motivations of this work, is whether a similar dichotomy 
occurs in spatial coalescents. It is known that if the mean-field A-coalescent comes down from 
infinity, then the number of particles in its spatial counterpart will be locally finite (see Section II. 3p . 
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It is natural to ask whether the total number of particles can nevertheless be infinite on infinite 
graphs. We answer this question for general spatial A-coalescents. It is remarkable that the answer 
is universal, in that it does not depend on the driving measure A nor on the geometry of the 
underlying infinite graph G. 

Theorem 1.2. For any measure A on (0,1) and any infinite graph G, consider the spatial A- 
coalescent on G started with n particles at a £ G. If N^{t) denotes the total number of particles at 
time t, then N^''{t) — )• oo almost surely, as n ^ oo. 

In particular, for A such that the mean-field coalescent comes down from infinity, the number 
of particles will be locally finite, but globally infinite. For this reason we call this phenomenon the 
global divergence of the spatial A-coalescent. 

Our next result on the fixed time asymptotics of A^" concerns a setting that is particularly 
relevant for some biological applications, where the coalescence mechanism is given by the Beta- 
coalescent with parameter a G (1,2) (as defined in the next section). 

Theorem 1.3. Fix a G (1)2), and consider the spatial Beta(2 — a, a) coalescent on a graph G 
with bounded degree. Start with n particles located at a £ G, and let N"'{t) be the total number of 
particles at time t. There are constants C,c> depending only on t,a and the degree bound such 
that 

pfA^"(t) > c Vol 5 (o,c log log n)) > 1, and 

pfiV"(t) < C Vol 5(o,C log log n)) > 1. 

Similarly to Theorem ll.H this may be stated more concisely as Nn{t) = ©(Vol B{o, ©(log log n))). 
When the graph has growth Yol B{o, R) x R'^, this translates to N"{t) x (log log n)"'. 

The above theorems describe the state of the system at a fixed time t. We also provide estimates 
for the number of particles that survive for a long time. Here the diffusion of particles plays a more 
important role, hence the results depend in a more fundamental way on the underlying graph. We 
focus on Euclidean lattices G = 1^^, d>2. 

Theorem 1.4. Assume that the coalescence mechanism is Kingman's coalescent. Let G = IJ^ , let 
m = log* n, and fix 5 > 0. Then there exist some constants c > and C > (depending only on 
d,6) such that, if d > 2, 

P fcm'^~2 ^ jv"(W) < Cm'^^'^) > 1, 

while, if d = 2, then 

P (clogm < N'^iSm^) < Clogm) > 1. 

n— >oo 

If the coalescence mechanism is a Beta- coalescent with parameter a G (1, 2), then the same statement 
holds with m = log log n. 

One interpretation of this theorem is that, when the underlying graph G is Z'^, the resulting 
random particle system may also be thought of as a microscopic description of the small-time 
evolution of a solution to the parabolic nonlinear partial differential equation: 

dtu=^Au-fiu^, (/3>0), (2) 
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starting from a singular initial condition, such as a Dirac delta measure at a given spatial location. 
We refer the reader to |2H [22] for a discussion of this equation. 

As suggested by Theorem II. 4[ for the study of the long-term particle system behavior it is 
natural to rescale the particle system's time by a factor of m?, while rescaling space by a factor 
of m. Theorem 11.41 indicated that the rescaled system should exhibit a Boltzmann-Grad limiting 
behavior, i.e., the number of interactions (intersections and coalescences) between a particle and 
all others over any finite time interval is tight and does not tend to as n — )• cxd. The behavior 
of the rescaled system mirrors the system of Brownian coagulating particles studied in [21] and 
[22], in which the PDE ([2]) is derived as the governing macroscopic behavior, and is obtained 
as a particular case of the Smoluchowski system of PDEs. It is worth pointing out that in the 
current case, the discrete structure of the lattice remains important in determining the frequency 
of coalescence events even after space and time have been rescaled, which would alter the formula 
fixing the reaction coefficient in the limiting PDE. 

Remark. The log* function featured in Theorem 11.11 might remind the reader of a result of Kesten 
|23[ I24j . who studied the number of allelic types in a Wright-Fisher model with small mutation 
probability. 

In Kesten's model, allelic types take values in Z, and the type of an offspring is identical to that 
of its parent, except on a mutation event of a small probability (inversely proportional to the total 
population size). When a mutation occurs, the offspring's type is chosen by adding an independent 
Z-valued random variable (with some given, bounded, distribution) to the parent's type, that is, by 
making a random walk step from the parent's type. It turns out that the number of types (and, 
in fact, their relative positions in space) has an equilibrium distribution. It is shown in \23\ [23] 
that the number of observed types at equilibrium is of order log* n, where n is the sample size. 
The above Fisher- Wright model may seem closely related to the one-dimensional spatial Kingman 
coalescent, but on a closer look one realizes that the dynamics of the two models are quite different, 
and there is no direct relation between the results. 

Kesten's result may be phrased as follows. Let T„ be the tree generated by Kingman's (non- 
spatial) coalescent started with n particles, and consider a branching random walk indexed by r„. 
Then the number of distinct values at the leaves is of order log* n. A variation of the strategy used 
in Section [3] applies in this setting, and can lead to an alternate proof of Kesten's result. 

1.2 Heuristics and proof ideas 

It is evident from Theorems ll.ll and ll.3l that the long term behavior of the number of particles in the 
spatial coalescent depends delicately on the precise nature of the coalescent. We now describe the 
approximate behavior of the spatial coalescent started with a large number of particles, all located 
at o. The proofs are mostly a detailed treatment of the following heuristic observations. 

To understand the finite initial condition, we turn to the infinite one. Consider a given A- 
coalescent which comes down from infinity (see below). Let Nt be the number of particles in the 
(non-spatial) coalescent started with A'^o = oo. For Kingman's coalescent it is the case that Nt ~ 2/t, 
whereas for Beta-coalescents with parameters {2 — a, a) with 1 < a < 2, we have Nt ~ Cat^~°' [31 [5]. 
The rough description that follows applies to both of these, as well as more general coalescents. In 
general, one would expect Nt to be concentrated (for small t) around some function g{t) (such a 
function is found in [3]). The coalescent started with N particles is similar to the infinite coalescent 
observed from time g~^{N) onward. 

Consider now the non-spatial coalescent with emigration, where each particle also disappears at 
some rate p. In fact, the parameter p may depend on the size of the population, as long as np{n) is 
non-decreasing. It turns out that for coalescents that come down from infinity, the emigration does 
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not influence Nf so much, and is still close to g{t). The total number of particles that emigrate 
when starting with N particles is then close to a Poisson variable with mean 

f{N) := [ g{t)p{g{t))dt. (3) 

(The upper bound of integration is some arbitrary constant.) 

Now comes the key observation: if N is large, the number of particles migrating back into o 
is negligible (under a technical condition that holds for most spatial coalescents), and in fact, an 
overwhelming proportion of those particles that emigrate will have emigrated by time g~^{f{N)). 
Thus we find that at this time, the number of particles at o and each of its neighbors is of order f{N). 
A second observation is that the resulting populations can be approximated by independent spatial 
coalescents, when observed from time g'"^(/(A^)) onward. In particular, at time g~^{f o f{N)) there 
are of the order of / o f{N) particles at each vertex in B{o,2). This "cascading onto neighbors" 
continues until step m, where m is such that / o • • • o f{N) {m repeated iterations of /) is of order 
1. Note that in these m steps a ball of radius m has been roughly filled. 

Applying this heuristics to the case of Kingman's coalescent and the Beta-coalescents with 
parameters (2 — a, a) and 1 < a < 2, gives the following. For Kingman's coalescent and constant 
p, we have f{n) ~ 2plogn, and for Beta-coalescents we have /(n) ~ Ccipn^~°', for some constant 
Ca > 0. Thus in the first case, m = log* N. In the second case, we find m ~ c log log A^. In general, 
this gives m ~ f*{n), where 

f*{n) =inf |m> 1 : /o- - -o/ (ra) < l|. (4) 

m iterations 

Note that if p{n) decreases fast enough so that f{n) is bounded, then it follows from this heuristic 
analysis that the spatial coalescent will come down from infinity globally. However, when p is 
constant, it can be proved that / is always unbounded, which in turn implies the result about 
global divergence of any spatial A-coalescent. 

Turning to the long time asymptotics, by the above reasoning we may start from a configuration 
consisting of a tight number of particles at each site of the ball of radius m around the origin. 
Since the number of particles per site is tight, the coalescent dynamics influences the evolution 
less than the diffusion. In particular, the structure of the underlying graph becomes important for 
the asymptotic behavior of the process. For simplicity, let us restrict ourselves to d-dimensional 
Euclidean lattices with (i > 3. Let p{t) denote the average number of particles per site in the 
ball of radius m at time t. Then at time = 1 we have pito) x 1 and limt_!.oo p{t) = 0. Each 
particle present in the configuration at time t coalesces with another particle at an average rate 
approximately p(t), so that ^N{t) = —N{t)p{t)/2. Dividing by the volume of the ball, one arrives 
to the ODE 

|mO = -^K<)^ (5) 

whose solution is given by p[t) = 2/[t + c) for some c > 0. 

The approximation ([5]) should be valid as long as the diffusion of particles away from the initial 
region (i.e., B(o,m)) is negligible. The influence of diffusion should start to be visible at times of 
order m^. In particular, at time m^, the density p{iTn?) is of order m"^, so the total number of 
remaining particles is of order m'^"'^. Assuming the plausible claim that the remaining particles 
are approximately uniformly distributed over a ball of radius order m, a simple calculation (using 
hitting probabilities for random walks) now implies that each of them has a positive probability of 
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never meeting any other particle again, and so the number of particles that survive indefinitely is 
of order m*^"^. 

We wish to point out that van den Berg and Kesten [Tj [8] have shown a density decay similar 
to ([5]) for a related model of coalescing random walks. However their results differ in two ways. On 
the one hand, the coalescence mechanism which they analyze is different. On the other hand, and 
more importantly, their initial condition is initially homogeneous in space, and not restricted to a 
large ball. This restriction is the cause of much of the difficulty in the current setting - see Section 
[7] for more details. 

1.3 Definitions and background on spatial coalescents 

Kingman's coalescent. Suppose that we are given an integer n > 1. Kingman's n-coalescent is 
the Markov process (n",t > 0), with values in the set Vn of partitions of [n] := {1, . . . such 
that IIq = {{!}, {2}, . . . , {n}}, and such that each pair of blocks merges at rate 1, and these are 
the only transitions of the process. Blocks of the partition 11" may be viewed as indistinguishable 
particles, and we often refer to the number of blocks of IT" as the number of particles alive at time 
t. A simple but essential property of Kingman's n-coalescent is the so-called sampling consistency 
property: the restriction of (n""*'^,t > 0) to [n] has the same distribution as an n-coalescent. This 
enables one to construct a Markov process {Ilt,t > 0) with state space V, the set of partitions of 
N, such that the law of 11 when restricted to [n] equals the law of 11". In particular, the initial 
state of this process is the trivial partition Ho = {{!}, {2}, . . .}. The process 11 is called Kingman's 
coalescent. For background reading, see for instance [17 ^ 134 ^ [6]. 

A-coalescents. Let A be a finite measure on [0,1]. A coalescent with multiple collisions, or A- 
coalescent, is a Markov process (11^ ,t > 0) with values in the set of partitions of N characterized 
by the following properties. If n E N, then the restriction of {Ilt,t > 0) to [n] is a Markov chain 
(n|"\ t > 0), where IIq = {{1},{2},... ,{n}}, and where the only possible transitions are mergers 
of blocks (it is possible to merge several blocks simultaneously into one block, but no two mergers of 
this kind can occur simultaneously) so that whenever the current configuration consists of b blocks, 
any given fc-tuple of blocks merges at rate 

Xb,k= [ x'^-^l-xf-'^Aidx). (6) 
J[o,i] 

Note that O'' is interpreted as 1, so that an atom of A at causes each pair of particles to coalesce 
at a finite positive rate A({0}). In this way any A-coalescent can be thought of as a superposition of 
a "pure" coalescent with multiple collisions driven by measure A((ix)l(0, and a time-changed 
Kingman's coalescent. An atom of A at 1 causes all the particles to coalesce at some positive fixed 
rate. Such A-coalescent may be viewed as a killed A'-coalescent where A'{dx) = A((ix)l[0, l)(x). 
Kingman's coalescent is a particular A-coalescent, obtained when the measure A equals Sq, the unit 
Dirac mass at 0. Any A-coalescent 11 is sampling consistent, that is, if m < n then the restriction of 
n" to [m] is equal in law to 11™ . It is this observation that allows one to construct an infinite version 
of the process. It is interesting to note the following fact shown by Pitman |33j : A-coalescents are 
the only exchangeable Markov coalescent processes without simultaneous collisions. We refer the 
reader to [33j for definitions and further properties. 

As already mentioned, if A{dx) = dxl^^ij, the corresponding A-coalescent is usually called the 
Bolthausen-Sznitman coalescent, and more generally if A is the Beta(2 — a, a) distribution where 
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Q G (0, 2) is a fixed parameter, that is, 



A{dx) 



1 



(1 - x)""^ dx, 



(7) 



r(2 - a)r(a) 



X 



the corresponding A-coalescents is called B eta- coales cents with parameter a. The Bolthausen- 
Sznitman coalescent is the special case a = 1, and it does not come down from infinity. For 
a E (1,2), the corresponding Beta-coalescents come down from infinity, and they are important 
processes from the theoretical evolutionary biology perspective, due to the following result from 
|35j : the Beta-coalescent with parameter a G (1,2) arises in the scaling limit of population models 
where the offspring distribution of a typical individual is in the domain of attraction of a stable law 
with index a. Apart from the Kingman coalescent, the Beta-coalescents with parameter a G (1,2) 
are the most-studied class of A-coalescents (see, e.g., [HI Oil]). 

Spatial coalescents. As informally described above, spatial coalescents are processes which com- 
bine spatial motion of individual particles with coalescence of particles located on the same site of 
a given graph of bounded degree. Let A be a given finite measure on [0, 1]. A spatial A-coalescent, 
as defined in |31| . is a Markov processes (n|,t > 0) with values in the space 

partitions of {1,2,...} indexed by spatial locations. That is, an element x = (7r,£) G consists 
of a partition vr = {Ai, A2, ■ ■ .}, and a sequence i = (^1,^2, • • •)' where ii specifies the location of 
the block A^. There are only two types of transitions possible for IIj = {Ilt,£t)- (i) provided there 
are b blocks at a location v £ V, then any given /c-tuple of them will merge at rate Xb^k given by 
([6]), independently over v; and (ii) independently of the coalescent mechanism, each block A^ of vr 
migrates at rate 9. This means that if the block is at v, then some vertex w is chosen according to 
the distribution p{v, •), where p{v,w) is a given Markov kernel. When this happens, ik is changed 
from V to w. To simplify the discussion, we will assume unless otherwise specified, that p{x, y) is 
the transition kernel for the simple random walk on the underlying graph G. 

If vr is a partition let i j mean that the particles labeled i and j belong to the same block of 
vr. For (vr,^) G and v £ V, denote by ^i,{7r,£) the number of blocks in vr with label (location) v. 

Spatial A-coalescents inherit the sampling consistency directly from A-coalescents. Namely, 
if we consider a spatial coalescent started from n + 1 particles (that is, blocks) and consider its 
restriction to the first n particles, the new process has the law of a spatial coalescent started from 
n particles. This simple property will be used on several occasions. In particular, it implies that 
if (7r^,£^) and (vr^,^^) are such that ^y{TT^,i^) < ^^(vr^,^^), for all v, then there exists a coupling 
of two spatial coalescents ((^t^ £|), (n|, £f )), t > 0) such that {UUi) = {7r\t), i = 1,2 and 
i^v(n.l,£l) < #„(n^,^^) for all V, almost surely. The same property guarantees the existence of 
spatial coalescents started with infinitely many particles on an infinite graph (see Theorem 1 in [31] 
for a particular construction). 

Spatial A-coalescents may be started from configurations containing countably infinitely many 
particles at each site of G, see [31| . However, our main results concern spatial A-coalescents started 
from the following initial condition: 



where o is some given reference vertex called the origin of G. In words, all the infinitely many 
particles are initially located at the origin o. 
From now on we abbreviate 



n^ = ({{i},{2},...},(o,o,... 



)) 



(8) 



X,{t) = #,{Ut,it) 



and 



x:{t) = #,in2,it). 



(9) 
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We denote the total number of blocks by N*{t) = Y.v&v ^v{t) (resp. iV"(t) = Ylv^v^vi^))- When 
not in risk of confusion, we wih drop the superscript n to simphfy notations. It is clear from 
the definitions that both ({X^(t)}„gG, t > 0) and {N*{t),t > 0) have Markovian transitions, with 
respect to the filtration generated by the coalescent process 11. They carry only partial information 
about the evolution of the corresponding spatial coalescent, in particular, they do not determine 
the evolving partition structure. 

In the language of theoretical population biology, a sample of n individuals is selected from 
the population at the present time, and their ancestral lineages are followed in reversed time. The 
above transition rules (i)-(ii) given above reflect the idea that individuals typically reproduce within 
their own colony (so that only particles on the same site may coalesce), and occasionally there is 
a rare migration event, which corresponds to the random walk transitions. In the case where the 
coalescence mechanism is simply Kingman's coalescent, we note that this model may be viewed as 
the ancestral partition process associated with Kimura's stepping-stone model [25\ I26j. 

Coming down from infinity. Let (Il^jt > 0) be Kingman's coalescent. As already mentioned, 
Kingman |27(l28j realized that while 11 starts with an infinite number of blocks at t = 0, its number 
of blocks becomes finite for all t > 0, almost surely. A coalescent with multiple collisions may 
or may have the same property, depending on the measure A. More precisely, there are only two 
possibilities as shown in [33]: let E (resp. F) denote the event that for all t > there are infinitely 
(resp. finitely) many blocks. Then, if A({1}) = 0, either P{E) = 1 or P{F) = 1. When P{F) = 1, 
the process 11 is said to come down from infinity. For instance, a Beta-coalescent comes down from 
infinity if and only if 1 < a < 2, henceforth we make this an assumption whenever working with 
Beta-coalescents. 

In the context of spatial coalescents, assuming that A({1}) = 0, Proposition 11 in j31j implies 
that when the initial number of particles is infinite, then Xy{t) becomes finite for all v £ V and 
t > with probability 1, if and only if the underlying measure A is such that the mean-field (i.e., 
non-spatial) A-coalescent comes down from infinity. In this situation, we may say that the spatial 
coalescent comes down from infinity locally. Naturally, this stays true if the initial condition is ([8]). 

Other notations. Unless specified otherwise, c, C (and variations ci, C2, . . .) will henceforth de- 
note positive constants that depend only on the underlying graph, and that may change from line 
to line. Typically, c, ci, . . . denote sufficiently small, whereas C, Ci, . . . denote sufficiently large con- 
stants. We also use the symbols a„ ~ bn and o„ x 6„ to denote respectively that an/bn — )• 1, and 
ttn/bn is bounded away from and 00, as n —t- 00. 

Organization of the paper. The rest of the paper is organized as follows. Section [2] starts 
with some preliminary remarks and observations concerning large deviation estimates for King- 
man's coalescent and Ewens's sampling formula, as well as several couplings between the spatial 
A-coalescents and the corresponding (mean-field) A-coalescents, which will be used throughout the 
paper. Section [3] contains a proof of Theorem II. II on the behavior of the spatial Kingman coalescent 
in finite time. As many of the subsequent results in the paper build on this, we recommend reading 
this section prior to any of the following sections. Section |4] contains the proof of Theorem 11.31 on 
the finite-time behavior of the spatial Beta-coalescents. Section [5] returns to the general case of 
spatial A-coalescents and arbitrary graphs with bounded degree, and contains the proof of global 
divergence (Theorem ll.2p . In final Sections [6] and [7] we study respectively the lower bound and the 
upper bound for long term behavior of Kingman's coalescent (as stated in Theorem 1 1.4p . The lower 
bound obtained in Section 16.21 is true for general A-coalescents, but we provide in Section 16.31 an 
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alternate shorter proof for the special case of Kingman's coalescent, that also gives tighter bounds. 
The proof of the upper bound in Section [7] turns out to be the most technical part of the paper, 
and it is based on a delicate multi-scale analysis. 

Sections HHG] may be read in any order, depending on the interest of the reader. We recommend 
reading Section [6] prior to Section [71 



2 Preliminary lemmas 

2.1 Some large deviation estimates 

We begin with an easy Chernoff type bound for a sum of exponential random variables, which 
we prefer to state in an abstract form now so as to refer to it on several occasions later. In our 
applications, ES" will typically be small. 

Lemma 2.1. Let {£^j}jg/ he independent exponential random variables with E,Ei = Let S = 
X^jg/ Ei- Then for any < e < 1 



'{S < (1 -e)E5) < exp 



Additionally, for Q < e < ^^^^ 



¥,S sup|/ii} ' 

F{S > (1 + e)ES) < exp 



' 4VarS 



' AYavS 



Remark. If I = {n, n + 1, . . . } and jx-i ^ ci " for some a > 1, then as n — t- do, Eg^^p^^.i is bounded 
away from 0, hence the second bound holds for all e > small enough, for all n. 

Proof. Using Markov's inequality, for any < A < ^ infj^^"^} 

P(5 > (1 + e)E5) < e-^(i+^)^^Ee^^ 

- „-A(l+£)ES TT 1 

-lil-A^, 

< e-^(i+=)^^ exp A/ii + AV?) 

_ -AeES'+A^ Var 5 

where we have used that for x G (0, 1/2) we have — ln(l — x) < a; + x"^ . Taking A = f^^fj) which is 
allowed since e < f^stnpf/i } ^ yields the upper bound. 

The lower bound follows from a similar argument with A = — ^ g . □ 

We now apply this to get a large deviation estimate for Kingman's coalescent. This uses a simple 
idea which can already be found in Aldous [1], who used it to prove a central limit theorem for 
the number of particles at time t. Denote by P" the law of the (non-spatial) Kingman coalescent 
started with n blocks. Let N{t) be the number of blocks at time t. 

Lemma 2.2. Let t = t{n) in such a way that t{n)^^ = o(n). For any < e < 1/2, for n large 
enough, 



l-.<^<l + .J>l-exp^ t 
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Proof. For the upper bound, let m = [(1 + e)2/i]. The time it takes the process to get from n 

to m particles is a sum of independent exponential random variables with means (2) for k = 
m + 1, . . . ,n. Call this sum S. If N{t) > m then S > t. We have 



E 2 ~2m-i~t/(l + e) 
i — ™ 1 1 \ / 



k=m+l 

provided m = o{n). Similarly, 



^^'^= E 2 ~(4/3)m 
(3 + o(l))e2 



-3 



fc=m+l 

Thus, for e < 2/3 + o(l), 

nS >t)< exp , 2^ 
by Lemma l2.ll The lower bound is similar using the upper bound on 5. □ 

We now consider Kingman's coalescent with spatial migration. Let P" be the law of a simpli- 
fied process where n particles initially located at a single site coalesce according to Kingman's 
dynamics, while each particle (or block of particles) migrates at rate p, and any block that migrates 
away from o is ignored from that time onwards. Denote by Z„ the total number of blocks that ever 
migrate away from o. 

One can think of each migration event as of a "unique mutation on the genealogical tree" , by 
giving it for example the label equal to its occurrence time. Since migrations happen at rate p for 
each block present in the configuration at site o, one quickly realizes that Zn is a realization from 
a well-known distribution arising in mathematical population genetics. Namely, set 9 = 2p, and 
suppose that on the (non-spatial) Kingman coalescent tree mutation marks occur at a Poisson rate 
of 6/2 per unit length. Using the language of mathematical population genetics, assume the infinite 
alleles models (all mutations create a different allele, and so different individuals in the original 
sample of n are in the same family if and only if they descend from the same mutation and there 
has been no other mutation between this common ancestor and the present individuals). The marks 
of the mutation process generate a random partition Ilg on the leaves of the tree by declaring that i 
and j are in the same block of Hg if and only if there is no mutation mark on the shortest path that 
connects i and j. In Figure [1] different blocks of this partition are represented by different colors. 
Then it is easy to see that Zn has the law of the number of blocks in Ilg. It is well-known (see, e.g., 
(3.24) in Pitman [3l]) that Z„ is of order 9logn for large n. The following large deviation estimate 
is part of the folklore, but we could not find a precise reference for it in the literature. 

Lemma 2.3. Fix e > 0. There are c, C > such that 

Zn 



log n 



> e] < Cn' 



Furthermore, for any U, 



P"(Z„ >U) < Cn^e-^. 

The proof is based on the Chinese restaurant process representation of Ewens's sampling formula. 
Let Kn,i be the number of blocks of size i in Ug, where i = 1, . . . ,n. Then the distribution of 
(Kn^i, . . . , Kn,n) IS given by Ewens's sampling formula ESF{9): 

n' -A- 9°'' 

^'^"-'-■"■'=^—">= .(a+i)...'(.+„-i n?^- ("» 
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Figure 1: The random partition generated by mutations (squares). Here Z5 = 4. 



for any given collection ai, . . . , a„ of non-negative integers such that ^11=1 ^'^i — ^■ 

The Chinese restaurant process representation of (fTOj) (see [Ml §3.1]), states that the number of 
blocks in Hg satisfies 



n 

d 



Zn = Y.^i (11) 

i=l 

where Ci are independent Bernoulli random variables with mean 

P{Ci = 1) = 

1 + 6 

Proof of Lemma 12.31 By (fTTI) we have for A > 



i<n 



, i<n 



< exp (-0(1 - e-^)(C + log?i)) . 



By Markov's inequality 



F{Zn < {l-e)9logn) < exp (^A(l - e)6l log n - 6'(1 - e"^)(C + log ?i) 

< exp ((-Ae0 + O(A2)) log n + C) 
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For small positive A the coefficient of log n is strictly negative. 
Similarly, for A > 



Ee- 



,xz, 



n — 




\j=i 



< exp 0(e^ - l)(C7 + logn) 



Thus, by Markov's inequality, 



¥{Zn >U) < exp 6l(e 



A 




(12) 



Taking U = {1 + e)9logn and A = A(e, 9) small enough gives the first upper bound. Taking A = 1 



A similar computation can be found in Greven et al. jl9|. Lemma 3.3]. 
2.2 Coupling and comparison 

The final general tool we use is a coupling of the spatial process with simpler coalescent processes 
such as non-spatial ones. While it is usual in coalescent theory to keep track of the entire partition 
structure as time evolves, here we are only interested in the number of particles remaining in the 
system at any given time, so accounting for the full partition data is cumbersome. 

However, the detailed Poisson process construction (e.g., |31j Theorem 1) becomes useful in 
the context of coupling. Its advantage comes from the following fact: all the information on both 
the merging and the migration is given by the Poisson clocks (ringing for jumps and for mergers), 
hence one builds the spatial coalescent process path (keeping track of the particle labels, and of 
the partition structure) by applying a deterministic function ui-hy-uj to this data. Therefore, it 
is straightforward to append another deterministic ingredient (as the "coloring procedure" in the 
following lemma) to the construction. 

Henceforth, it is convenient to consider the following simpler variation, where the partition 
structure is ignored. Label the n initial particles by l,...,n, and let xi,...,Xn be their initial 
locations. Let S*^, . . . , S""" be n i.i.d. simple random walks on G in continuous time with jump rate 
p started at xi, . . . ,Xn, respectively. To each A;-tuple of labels ii < ... < ik, and each b > k, 
corresponds an independent Poisson process M^^^^ with intensity Xb^k- The particles labeled 
ii < ■ ■ ■ < ik coalesce at a jump time t of M^, ■ if and only if they are all located at the same site 
V at time t~ , and there are a total of b particles at v. In this case, the newly created particle inherits 
the minimal label ii. (Subsequently there are no particles with labels i2, ■ ■ ■ ,ik-) In particular, its 
trajectory starting from time t will be (5*^, s > t). 

Suppose that the n initial particles are partitioned into classes according to a partition vr = 
(Si, . . . , Bj.) of {1, ... , n}, where r > 1. We wish to compare the system X = {Xy{t),t > 0}vev to 
the one which consists only of the particles that belong to a particular class B of vr. More precisely, 
for each B it, denote by the spatial coalescent process whose initial configuration contains 
only the particles from B. Denote by N{t) the total number of particles of X{t) and by N^{t) the 
total number of particles of X^ (t) . 



gives the second claim. 



□ 
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Lemma 2.4. There is a coupling {X,X^^, . . . such that, almost surely, 

r 
i=l 

for each block B of n, and hence 

r 

N^{t) < N{t) < ^iV-^'(t). 

i=l 

Note that in the couphng given below, the processes X^"- are not independent. In fact, under 
weak assumptions on the coalescent and when all the blocks are "small" , there is a coupling including 
independence, see Lemma 17121 

Proof. Fix a realization of the process {X{t),t > 0) as described above. Note that at any time t, 
each particle in the current configuration can be identified with a set of particles from the original 
configuration, that have been merging (possibly in several steps) to form this particle (this is the 
partition- valued realization of the spatial coalescent). In the rest of the argument, we say that a 
particle intersects B £ tt ii its corresponding set intersects B. 

For each i = 1, . . . , r, and t > 0, let X^"- (t) be the configuration obtained from X{t) by restricting 
to only those particles that intersect -Bj. The consistency property of A-coalescents implies that for 
each i the law of X^"- is that of the spatial coalescent started from the initial configuration restricted 
to elements of Thus this is a coupling of the processes. 

In this construction we have X^' (t) < Xy{t), since X^ may contain particles that do not intersect 
Bi. Moreover, any particle contributing to Xy{t) intersects at least one Bi, giving the bound 
Xv{t) < T.i^v"{t)- The inequalities relating N{t) and N^'{t) are an immediate consequence. □ 

A second type of coupling we will need is between a spatial A-coalescent and its mean-field (i.e., 
non-spatial) counterpart. Fix a vertex u & V oi the graph, and consider a spatial A-coalescent 
{X^{t),t > 0}t,gy started with a finite number of particles and such that initially Xu{Q) = n. Let 
M{t) denote the number of particles on u at time t that have always stayed at u, and let Z{t) 
denote the number of particles that jumped out of u prior to time t. In parallel, let {N(t),t > 0) 
denote the number of particles at time i in a mean- field A-coalescent started with n particles. 

Lemma 2.5. There exists a coupling of X and N such that: 

M{t) < N{t) < M{t) + Z{t), a.s. for allt>0, (13) 

and 

N{t) - Z{t) < Xu{t) < N{t) + Z{t), a.s. for all t > 0. (14) 

Proof. The process M{t) may be realized as a mean-field coalescent where, in addition, particles 
are killed at rate p. In that case, if we let Z{t) denote the total number of particles that have 
been killed, we see immediately that on the one hand, M{t) < N(t), and on the other hand, 
N{t) = Mit) + Z{t) where Z{t) < Z{t). Indeed, M{t) + Z{t) counts the number of particles 
if we freeze particle instead of killing them. However, in N[t) these particles keep coalescing, 
and so the difference Z{t) = N{t) - M{t) < Z{t). This proves For ([H]), note first that 

Xu{t) > M{t) = N{t) - Z{t) > N{t) - Z{t). Finally, the last inequality in ^ is obtained by 
observing that Xu{t) is made of particles that never jumped out of u (there are M{t) such particles) 
and of particles that have jumped out of u and have come back at some time later, potentially 
coalescing in the meantime. There can never be more than Z[t) such particles, since this is the 
total number of particles that jump out of u. □ 
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In fact, one can be slightly more precise than the above estimate. We shall need the following 
observation. Define two processes 



S{t) = Z{t) - [ pM{s)ds V{t) = S{tf - [ pM{s)ds. (15) 

It is a standard (and easy) fact that both are continuous time martingales under the law P", 
with respect to the filtration T generated by the above coupling process. In fact, if we define 
Q = a{N{u),u > 0} to be the cr-algebra generated by A'', and let = a{Q,Ft}-, then the processes 
S{-) and V{-) are continuous-time martingales with respect to the filtration . 

Lemma 2.6. For each time interval [a, 6], we have the stochastic domination 

rb 



F{Z{b) - Z{a) >x\G)<¥ ^Poisson (^p j 



N{s)ds > X 



Proof. Given G, Z is a pure jumps process with jumps of size 1 that arrive at rate pM{t) < pN{t) 
at time t, almost surely. □ 

Finally, a global comparison with mean-field coalescents can be obtained in the case of the 
spatial Kingman coalescent as follows (see also |20l §6.1]). Let S be an arbitrary subset of vertices 
and consider the restriction of X to S. 

Lemma 2.7. Fix a time t < 2, and vertex set S, and assume that all particles are in S at time 0. 
Let Z = Z{t) he the number of distinct particles that exit S by time t, and let Ns{t) be the number 
of particles in S at time t. Then, for some c = c(e) > 0, 

iV,(r)>Z+ii±£M^<e-|S|/.. 



Note that the bound is independent of the starting configuration. This lemma is a precursor to 
Lemma 17.51 

Proof. Let Qt be the number of particles in S that have survived until time t but have not left S. 
We have then that Ng{t) < Z + Qt. The rate of coalescence inside S at time t is 



E("'f)^iK*f') 



(by Jensen's inequality for (2).) If Qt < '^\S\ for some t < t then we are done (since r < 2.) 
Otherwise, \S\ ■ (*^*2'^') > 2|^C2*)' ^^"^ Qt stochastically dominated by the block counting 
process of a Kingman coalescent slowed down by a factor of 2|5|. Lemma 12.21 completes the proof. 

□ 



3 Finite time behaviour of the spatial Kingman coalescent 
3.1 An induction 

The first step of the argument is to show that for some m (close to log* n) there are no particles 
outside B{o, m) at some specified time, and to provide lower and upper bounds (both polynomial in 
the volume of the ball) on the number of particles at each site inside the ball at the same time, on 
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an event of high probabihty. This can be done for any n, but it is easier to consider initially a sub- 
sequence of n's, and then interpolate to get the result for all n. With this in mind, for a given integer 
m denote Vm = Vol i?(o, m), and define n = n{m) := Tow(m, V^). Note that log* n = m + log* 
is very close to m, as m — t- oo. 

Define the sequence of times tk = (Tow(m — k, V^))~^ , where k = 0, . . . ,m. This sequence is 
increasing from to = to tm = Kn^- Moreover, t^+i 3> tk (in particular t^+i > 2tfc). Recall 
that 6 = 2p, where p is the jump rate of particles, and that A is the maximal degree in the graph. 
Define the events Bj. by 



Bk = = for all v ^ B{o, fc)} f] {x^{tk) 



,4t, 



for allv£B 



{o,k)], (16) 



We are particularly interested in the event Bm which states that at time tm each site of B{o,m) 
has between cV^ and CV^ particles, with no remaining particles outside Vm- 

Lemma 3.1. With the above notations, ¥{Bm) — 1 as m ^ oo. 

The idea is to prove a bound on ]P(-B^) by induction on k. For k = we have P"(i?o) > 1 — 2n^^, 
since the probability of a pair of particles coalescing by time to = is at most (2)^0; and the 
probability of a particle jumping by that time is at most to^^- The key to the induction step is the 
following 



Lemma 3.2. Fix constants oo, ai,e > 0. Consider the coalescent started with n particles, all located 
at u £ G: Xy{^) = n6u{v). Let r = a(logn)~^ for some a G [ao,ai], and define the event 

A = a, {X^ir) G [(1 - e)g„, (1 + e)Q,]} , 



where 



Qv 



2/t V = u, 

{d/du)\ogn \v -u\ =1, 
\v -u\> I. 



Then there exists a C depending on e, oo,oi,du only such that 

C 



log n 



Proof. In this argument, the expression with high probability (w.h.p.) stands for "with probability 
greater or equal to 1 — jjp^^"- Let Z{t) be the number of distinct labels corresponding to particles 
that exit u during [0, t] ^where each label is counted at most once). Let N{t) denote the total 
number of particles in the coupling with the mean-field coalescent of Lemma l2.5i Thus we have: 

N{t) - Z{t) < Xu{t) < N{t) + Z{t), almost surely. (17) 

Therefore one needs to estimate N{t) and Z{t). For any fixed e, by Lemma 12.21 we have 

¥''{\tN{t)/2 - 1| > e) < Ce-"/^ < Cn-\ (18) 

So the event {-jj^ G (1 — e, 1 -|- e)} happens with high probability. By Lemma 12.31 we have 



Z{oo) 
log n 



> e] < Cn~ 



(19) 
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In the rest of the argument consider the the process on the event B := {-277^ £ (1—^; !+£)}□{ € 
{9 — e,6-\-£)} that occurs with high probability. Note that on B, Z{t) < Z{oo) < (l + e)logn <^ 2/t 
and (117p imply the required bounds for Xu{t). 

Moreover, on {Z{t) < (1 +e)logn} D B, the probability that at least one particle jumps more 
than once before time r is bounded by pr(l + e)logn = C/log^?i. On the event that no particle 
jumps more than once, there cannot be any particle located at a distance strictly greater than 1 
from u at time r. 

Similarly, on {Z{t) < (1 + e)logn} D B, the probability of at least one coalescence event 
involving particles located at site v ^ u before time r is at most r((^+f°sn)^ 

again bounded by 

C/logn. We conclude that w.h.p. there is no coalescence outside of u before time r. 

This implies that w.h.p. the particles located at a neighbor t; of u at time r are precisely those 
that made a (single) jump from u to v. To show that their number is close to {6 /du) logn, it suffices 
to show that Z{t) is concentrated around ^logn (which is already known for Z{oo)). Namely, 
since (on the event of high probability) each jump is to made from u to a random neighbor of u, 
and there are no further moves or coalescence events involving the particles outside of u, X„(r) is 
concentrated near Z{T)/du for any u ~ n, due to a law of large numbers argument. Indeed, the 
number of particles jumping from u to any particular of its neighbors has variance of order logn, 
and using a normal approximation to binomial random variables, the probability of deviating by 
elogn from the mean is no more than Cn~^. 

Thus it remains to show that Z(oo) — Z{t) < elogn with high probability. To this end, note 
that Z{oo) — Z{t) is the number of particles that exit u after time r. Denote by J> the cr-field 
generated by the evolution of the process up to time r. By Lemma 12.31 monotonicity and the 
Markov property at time r, 

P"(Z(oo) - Z{t) > elogn|J^,)l|^(,)<3/,| < P^/- (Z(oo) > elogn) 

< C(3/T)^e-"'°s" 

<Cl/logn, □ 
and since {N{t) < 3/r} D B occurs w.h.p., this concludes the argument. 

Proof ofLemmalM We have FiB^) < F{B'^ D Bm-i) + P(S^_i) < ^(^S) + Efe<™ IP(^fcl^fe-i)- 
Noting that P(5g) < 1/n, we turn to estimating ¥{Bl\Bk-i). 

Given J't^_i, consider now the coupling of Lemma 12.41 applied to the process observed on 
[tk-i,tk], where the partition vr is J~'tk-i measurable and where two labels belong to the same 
equivalence class of tt if and only if their corresponding particles have the same position at time 

— 1/3 

tk-i- On the event B^-i we have that log Xu{tk-i) — logtfc_i tf, , u & B{o,k — 1), hence 
Lemma 13.21 applies to each of the corresponding processes. We conclude that with probability at 
least 1 — logx'^tfc 1) — 1 ~ ^^l!^ following occurs: during [tk-i,tk\ (i) No particle from u jumps 
more than once; (ii) At most 3/tfc particles remain at u; (iii) Each neighbor of u receives between 
■^t^ and ^tj, particles. Say that a vertex u € B{o, k—1) is bad at stage k on the complement 
of the above event. 

Applying the right hand inequality of Lemma [2. 4 ^ on the event B^-i H {there are no bad vertices 
at stage k}, we have that Xu(tk) = outside B{o,k) (since no particle jumps twice and at time 

— 1/3 

tfc-1 all the particles are inside B{o, k — 1). Moreover, each site v G B^ has at least -^t^ particles 
jumping to it from some neighbor u of v, hence Xy{tk) > -^t^^ . Finally, for each v G B{o, k) we 
have Xy{t}S) < Sfj^^ + Spt^^ < Afj^^ (here we may assume that t^J^ < l/(3p)), since it receives at 
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most ^tf, from each of its (at most A) neighbors. Hence -Bfc-i H {there is no bad vertex at stage 
k} C Bk. 

Therefore F{B'^\Bk-i) < P(3 a bad vertex at stage k) < CVk-itl^^ < Tow(m-ly^) - follows 
that 

^ m)^n^^^Tow{m-k,V;i)-\r' 
since the term for k = m overwhelmingly dominates all the others. □ 



3.2 Lower bound estimates 

Lemma 13.11 gives us a fairly accurate description of the spatial coalescent up to positive times of 
order o(l). Additional estimates are needed for understanding the behavior up to a constant time 
t. We begin with the lower bound, since it is simpler. Henceforth, we let t > be a fixed time. 
Recall that the initial configuration of the spatial coalescent consists of 7i = Tow(?7t,, Vj^) particles 
located at o. 

Lemma 3.3. Fix t > 0. The collection {Xt{v), v S B{o,m)) can be coupled with the family 
{Cv, V G B{o,m)) of i.i.d. Bernoulli variables with mean e"^*, so that 

FNv G B{o,m),Xy{t) > Cv) > 1. 

Proof. Assume that m is sufficiently large so that tm < t. By Lemma l3.lt with probability tending 
to 1, each site in B{o, m) is not empty at time tm- On this event, fix one particle at each v £ B{o, m), 
and color it red. Consider the evolution with coloring (see the proof of Lemma 12.41 for a similar 
construction), so that if a red particle coalesces with another particle, the newly formed particle 
retains the red color. Now, it is obvious that between time tm and t, each red particle has probability 
Q-p{t-tm) y ^-pt Qf j^Q^ migrating, independently of all other red particles, so the claim holds. □ 



3.3 Upper bound estimates 

After time tm, the bounds in the definition of Bm (cf. (|16|) ) still hold for most vertices, but will 
begin to fail for some vertices. As the number of particles per vertex decreases, the probability of 
failure increases. We overcome this by combining the second part of Lemma 12.31 with Lemma |2. 71 

Lemma 3.4. Fix e,t > 0, and start with n = Tow(m, V^) particles at o. With high probability 
there is no particle outside B{o, (1 + e)m) at or before time t, and the total number of particles at 
time t is at most CV{ij^ir)m- 

Proof. By Lemma l3.ll with high probability at time tm there are no particles outside Bm, and the 
number of particles inside B{o,m) is at most 4:Vm/tm = By ignoring coalescence transition 

after time tm, so that each particle performs a simple random walk independently of all the others, 
the number of particles located at any particular site at any later time can only become larger. 
Each particle makes an additional Poisson(/9(i — tm)) steps during [tmji]) so the probability that 
at least one of these particles makes at least em steps is bounded by 4V^C^"*/[emJ !. This last 
quantity tends to 0, since Vm < CA"*. 

Thus with an overwhelming probability, there are no particles outside B{o, (1 + e)m) at time t. 
By Lemma 12.71 and the above observation, the number of particles within B{o, (1 + e)m) is at most 
a constant multiple of V(i+£-),;„, again with an overwhelming probability. □ 
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3.4 Interpolation 

Proof of Theorem \l.l[ If n = n(m) = Tow(m, V^^) for some m, then Lemmas 13.31 and 13.41 imply 
that with high probabihty the number of particles at time t is between cVm and CV(i_|_£)m- Since 
log* n = m + log* ~ m, this implies the claim. 

For intermediate n, we use the monotonicity of the process in n. Note that log* n{m + 1) — 
log* < 2 (since Vm+i < e^™), so that the sequence n(m) is sufficiently dense to imply the 

theorem. □ 

Remark. Since m = log* n{m) — log* m + 0(1), the proof above gives the lower bound cl^„iog* ^. 
As for the upper bound, the proof of Lemma 13.41 works with radius m + Cm/ log m in general, and 
m + log m for graphs with polynomial growth. 

It is possible to get both lower and upper bounds that are closer to Vol -B(o, log* n). For the 
lower bound, one way would be to argue that most vertices continue to behave typically (as in 
Lemma l3.2p even up to constant times. 

The upper bound is more delicate. One way of improving it is by considering the evolution of 
the total number of particles in B{o,k) for k > m, similarly to the argument of Section [6l Under 
additional growth assumptions on the graph, both bounds are of order Vol i?(o, log* n). 

4 Results for spatial Beta-coalescents 

We now turn to the proof of Theorem ll.31 In fact, we prove a slightly more general result. Suppose 
that A has a sufficiently regular density near 0: A{dx) = g{x)dx, where for some B > and 
a <G (1, 2) we have 

g{x) ~ Sx^"", x-^0. (20) 

This includes the case where A is the Beta(2 — a, a) distribution. A consequence of (j20p is the 
following standard estimate for the rate of coalescence events when there are n particles remaining: 

Lemma 4.1. The sequence (A„)n>2 is increasing in n. Furthermore, there exists c > which 
depends only on a,B, such that if A satisfies (j20p . then A„ ~ cn". 



Proof. The monotonicity of A„ in n is a consequence of the natural consistency of A-coalescents. 
The second part of the statement is a consequence of (f20l) and Tauberian theorems. See, e.g., [TOl 
Lemma 4] for more details. □ 

4.1 Lower bound in Theorem Theorem 11.31 

Define the following parameters 

/3 = — - — T = an~^ for some a G [ao,ai] 7 = min{l — a/2, /3/2, 1/8}, (21) 

and observe that both 7 > and a — 2 + 7 < —7. We next consider the quantity 

Yn = r Nis) ds. 



Jo 

Lemma 4.2. Assume that A satisfies (I20p . Then for some c,C depending only on A, 

HYn > n^-^+T) < C7n~^, Vn > 2, (22) 

and 

^(Yn < cn^) < Cn-^, Vn > 2. (23) 
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Remark. It follows from Theorem 5 in [3] that ~ cn^ almost surely as n — )• oo, for some 
c > 0. However this result does not provide any estimate on the deviation probability. 

Proof. The key fact is that if the process N{t) attains some value k, then it stays at k for an 
exponentially distributed time with mean l/A^. Since the probability of hitting k is at most 1, 

Ey„ < y A < cn^- 

fe<n 

by Lemma |4.1[ The upper bound ()22p follows by Markov's inequality. 

The lower bound is more delicate. We argue that with high probability the first M = n°'~^~^'^ 
jumps all occur before time r and that throughout these jumps N(t) remains above n/2. Summing 
over only these jumps will give the lower bound (123p . 

Let Bm be the number of particles lost in the next coalescence when there are m particles 
present. It is known [U Lemma 7.1] that there exists C > such that 

P(B„ > k) < Ck-"^ for ah m,k>l. (24) 

In particular, Ei?^ < c for some constant depending only on A. Thus the total size of the first M 
jumps has expectation at most cM. Let tk be the time of the fcth jump in N{t), then by Markov's 
inequality 

P'^(7V(tM) < n/2) < < cn"-^+^ < cn-''. (25) 

n — n/2 

On the event that N(tM) ^ n/2, the rate of each of the first M jumps is at least Xn/2- Thus, 
by Markov's inequality, and by monotonicity of Am, 

F{tM > r, N{tM) > n/2) < < cn'^+^f+f^ < cn-^. (26) 

r 

Thus, combining ([26]) with ([25]), P(yl^) < cn^^ , where A = {tM < T,N{tM) > n/2}. 
Note that, on the event A, 

Yn= N{t)dt > / N{t)dt > {n/2)tM- 



JO 



It thus suffices to show that P"(tj,f < cn' ^) < Cn . However, the rate of each jump is at most 
An, and therefore 



M 



1=1 



where Ei are i.i.d. exponentials with rate A„. Now, from Lemma l4.ll we know that 



i<M 

and by Lemma |2. II with e = 1/2, 



P < cn"/-^ /2j < exp (-3^™"^^^^) < Cn'"/ 

as needed. This completes the proof of Lemma l4.2i □ 
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The next result gives a lower bound on the number of particles that exit the origin. This 
complements the upper bound of Lemma |2. 61 Recall that Z{t) is the number of particles that exit 
the origin by time t. The idea is that as long as Z is small, the true behavior is close to the upper 
bound. 

Lemma 4.3. Let A be the event {Z{t) < n^}. Then F{A) = 0{n~'^). 

Proof. We introduce the random time Ta defined for any < a < 1 by = inf{t > : Z{t) > 
aN{t)]. Define 

Ai = {Z{t ^Ta)<n^} A2 = Ar\{T>Ta}. 

Note that ^ C U ^2 so it suffices to prove that P{Ai) = 0{n~^), for i = 1, 2. 

Consider Ai first. Recall the notations introduced in Lemma [2. 61 and note that Ta is a stopping 
time with respect to the filtration J^* . Since N{t) is non-increasing with limit 1 and since Z(t) is 
non-decreasing and non-negative integer valued, Ta is finite if and only if at least one particle leaves 
0. This will eventually happen, so Ta is a.s. finite. Denote by P„ the law 

Pn(-) =lPn(-|a), 

of all processes, conditioned on the entire evolution of N. 

Consider the martingale St stopped at time Ta- By Doob's inequality, we find that for any S > 

sn^\Ss\>5 N{s)ds\< ^ fAl 



52 {^ifN{s)ds)' 
52 J-'^ N{s)ds 



< [^^ ^ ^ A 1. (27) 



The last inequality follows from the first bound of (jl3p . which implies that ]E„(J(^" M{u)du) < 
Jq°' N{u)du. Define the event 



As = \l-a-5< <l + 6 

Until time Ta we have M{t) > (1 - a)N{t), and so ^ and ^ imply 

We fix a and S such that 1 — a — 6 > 1/2. After taking the expectation, we obtain, using (p2]) : 

P"(^i) < 0(n^^) + n"^2-^ = 0(n-^) 

Turning to A2, note that 

A2 C {aiV(r) < n'T} 

We claim that 

P"(aA^(r) < n^) < Cn~^. (28) 

To see this, we use the following rough estimate. Note that by (|24p . there is a probability at least 
1 — Cn~^°' that N{s) € [n"' + l,2n'^] for some s. In this case, the process will wait an amount 
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of time greater than an exponential Y with rate A2n7 before the next jump. It follows that (since 
7 < /3/2 and a < 2), 

P"(iV(r) < n^) < (1 - Cn-^°)P(y < r) 

< 1 - C7n-T" - exp(-cT?i"^) 

< 1 - Cn"^" - exp(-cn-^) 

< cn"'''. 

This completes the proof of Lemma 14. 3i □ 

We are now ready to start proving the lower-bound of Theorem 11.31 Let > be a fixed time. 

Lemma 4.4. Fix constants ao,ai such that 1 < aQ < ai. Consider the coalescent started with n 
particles, all located at u ^ G: X^{Q) = n6u{v). Let r = an~^ for some a G [ao,ai], and define the 
event A by 

A = {Xu{t) > ny{4du)} n n,^u{X,{r) > ny{4du)}. 

There are constants c,C depending on aQ,ai,du only such that ¥{A^) < Cn~^. 

Proof. The fact P(X„(r) < n^'/^^du)) < Cn~'^ is a direct consequence of (|28p where we choose 
a < 1 < Adu satisfying 1 — a — 5 > 1/2. For v ^ u, Lemma 14.31 gives a bound on the probability 
that not many particles leave the origin. It is highly probable that a proportion close to 1/du of 
these particles jumps to v. It remains to estimate the number of particles that move to v and 
subsequently coalesce. 

If all the particles that migrate to v do so immediately at time 0, so that they have strictly 
more opportunities to coalesce, the number of particles remaining at v at time r would still be 
sufficiently large. Indeed, it would then take Y amount of time, where Y is an exponential random 
variable with parameter A„7/(4d), before the first coalescence. Since ^ cmf^ for all tti ^ 1, we 
deduce that E(y) > cn~''"'. However, since 7 < /?/2 and a < 2, we have r = an~^ <^ cn~''"', hence 
P(y < r) < cn""^-/^. 

In addition, note that by Lemma 14.11 the total jump rate of n'^ particles is smaller than the 
total coalescence rate (since a > 1), so the probability any of the particles that jump to v makes 
an extra jump before time r is smaller than cn°'''~^. It follows that there are at least n'^'/{4du) 
particles located at v at time r, with probability greater than 1 — Cn~^. □ 

Proof of Theorem \1.3\ (lower bound). Let /fc(n) = fof.. .0 f(n) {k iterations) where f{n) = n'^ /Ad. 
Define the sequence of times (r^)^^ 

Tfc = Tk-1 +afk-i{n)~^. 
It is easy to check that if we take k = k{n) = log log n/(— 2 log 7), then 

fk{n) > cexp(A/logn) 

Let A' be the event that at each site within radius k there are at least /fc(n) particles at time 
Tfc. On A' ^ reasoning as in Lemma 13.4^ (at each site of this ball at least one particle remains with 
positive probability until time to), we see that N^{t) > Vol B{o, k) > c Vol i?(o, clog log n) for some 
c > 0. Thus to obtain the lower bound of Theorem 11.31 it suffices to compute the cumulative error 
probability in the iterated application of Lemma 14.41 However, it is easy to check that 

k 

P(A'") <^C\o\B{o,i)fi{n)"^ < CkYolB{o,k)fk{ny^. 

i=l 

Since Voli3(o, k) < A^, where A is the degree of the graph, this converges to as n — >• 00. □ 
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4.2 Upper bound in Theorem Theorem 11.31 



The proof of the upper-bound in Theorem Theorem ll.3l requires a few additional estimates. 

Consider the spatial coalescent on any graph G. Given some subset A d V the vertices, 
denote by Qt the number of particles that are present in A throughout the time interval [0,t]. 

Lemma 4.5. There are constants c, C > which depend on A only, so that 

lP(Qto > Ct-'/("-'Vl) < exp(-cl^l). 

Proof. Ignoring the particles after they exit A, one may assume that any particle leaving A is 
immediately killed. The main reason for Qt^ being small is the coalescence. The total rate of 
coalescence at a site v holding particles is Ax„ ~ c(X^)". At each such event at least one 
particle disappears, and therefore the total rate of decrease of Qt at time t < to is at least 

due to Jensen's inequality, since a > 1. (This is similar to |3H Theorem 12], but the above inequality 
is stronger). Thus [Qt^t < ^o) is stochastically dominated by a pure death chain where the rate of 
decrease from z to i — 1 is c|A|^~"i'^. 

One concludes the argument using Lemma 12.11 Let Ek be independent exponential random 
variables with mean = c\A\°'~^k~'^ , and define Sk = '}2k>K ^k- Then we have 

nQto >K)< F{Sk > to). 

To apply Lemma l2.1l to Sk we need to estimate ¥.Sk and Var Sk- note that for suitable constants, 
as K ^ oo, 

ESk = 5^ ^fc' ~ cil^r-ii^i-" (29) 

k>K 

and 

k>K 

In particular j^^^^^ is asymptotically constant and we may apply Lemma |2. II with some constant 
£. Thus for some C3 > 0, 

¥{Sk > 2ESk) < exp l-c^^^^^^ 



Var Sk 



Now, if K is such that ESk < to/2 we may conclude that 

From ( 1291) we see that K = CtQ ^^1^1 works for C lar ge enough. □ 

Lemma 4.6. Fix constants ao, ai,£ > 0. Consider the coalescent started with n particles, all located 
at u ^ G: ^^^,(0) = n6u{v). Let r = an~^ for some a G [ao, ai], and define the event A by 

A = []{X,{t) < CiQ4, 

V 
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with 

Q 



^2-a+7 if\v-u\<r:=\A/{a-l)], 
otherwise. 



Then there are constants C,Ci depending only on A, ao,ai such that P{A^) < Cn . 

Proof. With sufficiently high probabihty at most n^^"^'^ particles leave the origin by time r (due 
to Lemma 12.61 and (j22p ). This implies the bound for < \v — u\ <r. 

Some of the at most n'^"^"'"'^ particles leaving u may coalesce before time r, but this may only 
reduce further the number of particles. We claim that except on an event of polynomially small 
probability, none of these particles makes more than r jumps by time r. Indeed, the probability 
that by time r, a given particle has jumped more than r times is smaller than [pan~^Y and there 
can never be more than n particles in total. Thus if r is such that n}~^^ < n~'^, the probability of 
any particle reaching distance r is indeed smaller than Cn~^ , implying the statement of the lemma 
for any v such that |f — ii| > r. For the case v = u we invoke Lemma 14.51 with an arbitrary set 
A B u of size clogn. If c is large enough then, except on an event of probability bounded by , 
we have Qr < Cn^^'^\A\ <C n^/^. However, X„(r) < + Z^-, and so by Lemma 12.61 and (j22p again, 
Xuir) < n^/"^. It is easy to see from that for all a G (1, 2) we have 2-a + 7<3/4.) □ 

Proof of Theorem \1.3[ - upper bound. Note that for any a we have 7 < /3/2 < 1/2. Let c = 
max(7,3/4), and note that c < 1. Let C2 = Ci x YolB{o,r), where Ci and r are the constants in 
Lemma 14.61 

Let f{n) = C2n^^^, and as before set /fc(n) = /o • • • o f(n) {k iterations). Also set n = r = n~^, 
and 

Tk = Tk-1 + a/fc-i(n)"^. 
Let Ai be the event that at time Ti there are no particles outside B{o,ir) intersected with 

n {Xv{ri)<Mn)}. 

V, |?;|<jr 

Choose k = k{n) to be the maximal k so that /fc(n) > logn. It is clear that < (logn)^. It 

is also straightforward to check that k ~ clog logn, and that = o(l). 
Applying Lemma 14.61 iteratively, we see that 

F{Al) < TCfi{ny^\o\{B{o,ir)) < C Vol(i?(o, A;r))/fc(n)"^ > 0. (31) 

n— >oo 

i<k 

Consequently, at time Tk the total number of remaining particles is at most Cfk{n)\ol{B{o,kr)), 
and these particles are all located in B{o, kr), with high probability. 

Consider now the set B' = i?(o,M log log re) for some large M to be specified soon. In order for 
any particle to exit B' by time t it must survive to time Tk and jump at least M log log n — kr times 
by time t. Thus the expected number of particles that exit B' by time t is at most 

C/fc(n)Vol(5(o,A;r))e-^(^'^'°s^°er^-fcr-) ^ C'(logn)2(loglogn)'^e-(^*^-^')^°siogn^ 

Fix M large enough that the last expression tends to as n — )• 00. 

Finally note that if no particle leaves B' then Ylv-^vit) = Qt- By Lemma 14.51 with high 
probability the number of particles that remain in B' throughout [0, t] is at most 0((log log n)*^). □ 
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5 Global divergence of spatial A-coalescents 



5.1 Infinite tree length for A-coalescents 

Fix an arbitrary probability measure A on [0, 1]. Consider the corresponding mean- field A-coalescent 
that starts from a configuration consisting of infinitely many blocks, and let > 0) be the 

number of blocks process of its restriction to the first n particles. Define: 

Xn{t)=Xn= [ - (32) 

Jo 

The notation might be suggestive of the Kingman coalescent, so we wish to point out that the 
measure A in the following calculation is quite general. 

We are interested in the quantity Xn due to the following observation: if K"" is a good approx- 
imation for the number of blocks at the origin of the spatial A-coalescent at small times s, then for 
t small, pXn approximates well the number of particles that emigrate from the origin up to time 
t (see, for instance. Lemma l2.6p . The key ingredient in the proof of Theorem 11.21 is the following 
result. 

Lemma 5.1. For any fixed t > we have Xn ?• oo almost surely. 

n— >oo 

Proof. Denote by ~* the equivalence relation on the labels generated by the coalescent blocks at 
time t. For n > 2 let 

Tn '■= min{t > : 3j < n s.t. n ~* j} 

be the first time that the particle labelled n coalesces with any of the particles with smaller labels. 
We have that 

K"(.) = i^"-H«) + i{.<.„}, 

and therefore 

Xn = Xn-i -I- (r„ A t), 

i.e. the contribution to Xn of particle n is t„ A t. 

Define Tn to be the cr-algebra generated by {Ks}j<n,s>o- Conditioned on Tn-i, the infinitesimal 
rate of coalescence of particle n with particles with smaller labels at time s is given by 

[ ^ -x- (1 - (1 -x)^""'W)dA(x). 

^[0,1] X 

Applying (1 — x)^ > 1 — kx (for x G [0, 1]) we find that the rate of coalescence of particle n is at 
most K^~^{s) (with equality if and only if A is the point mass at 0, in which case the coalescent is 
Kingman's coalescent). Thus 

E(r„ At|j;_i) = / F{Tn > s\Tn^i) ds 
Jo 

> exp(^- i^"-i(n)(iu^ ds 

> exp {-s - (K"-i(u) - 1) dv^ ds 

= e-^"-^ [ e-'ds = {I- e-^)e-^"-\ 
Jo 
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Note that X„ is increasing and consider the martingale 

n 

Mn = Xn-Y,^i^kM\Fk-l). 
k=2 

On the event that Xn is bounded, the last calculation implies that E(rfc A t\J^k-i) is bounded from 
below, hence M„ — t- — oo. Since M is a martingale, the last event has probability 0. □ 

Note that a different proof of Lemma [5.11 follows from Corollary 3 in [3], although the arguments 
there are significantly more involved. 

5.2 Proof of Theorem Q 

We now consider the spatial coalescent corresponding to some fixed A as in the previous section, on 
an arbitrary locally finite graph G. As usual, let n denote the initial size of the population, with 
all particles initially located at o, a fixed vertex of G. Recall the definitions of the processes M and 
Z in Lemma 12.51 Both processes M and Z depend implicitly on n, omitted from the notation. We 
consider the usual coupling of coalescents that correspond to different n. 

Lemma 5.2. For any t > we have that Z{t) — > oo almost surely as n ^ oo. 

Proof. We follow the argument of Lemma 14. 3| except that we are only interested in showing that Z 
diverges, which simplifies the argument. Since Z{t) is non-decreasing in n it suffices to show that 
for any fixed m we have ¥{Z{t) < m) > 0. 

n— >oo 

Recall the martingales (jlSp . On the event {Z{t) < m}, we have for all s < t that Mg > 
N(s) — Zs > N{s) — m, and therefore St < m + pmt — pN{t)dt. Due to Lemma l5.1|, for any fixed 
m,t and any sufficiently large n, on the event {Z{t) < m} (this event also depends on n) 

St<-\l%N{t). 

As in (j27p . Doob's maximal inequality yields that for large enough n 

F{Zt < m\g) < P f sup|5,| > ^ / pNsds 

V s<t 2 Jo 

By Lemma l5. II the right-hand side tends to almost surely, so ¥{Z{t) < m) > 0. □ 

Fix e > and a vertex v of the graph, and let Em,e,v = {supjgjo^g] X^{t) > m} be the event that 
at some time t < e there are at least m particles located at site v. 

Lemma 5.3. We have F"-{Emev) > 1- 

Proof. Note that the claim is trivially true ii v = o. We prove it first for v a neighbor of o. Take 

to = r/min{e, A;;\ 

where rj is an arbitrarily small number. Now, choose no = no(to) large enough that > 2dm with 
probability at least 1 — rj, where d = deg(o). By the weak law of large numbers, one can choose 
n-o = no{tQ,h) large enough that on the event {Ztg > 2dm}, v receives at least m particles from o 
with probability at least 1 — rj. We concentrate on this event of high probability, and on these m 
particles, ignoring any further particles that might visit v. 



< 
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Jumps from v occur at rate p per particle, so the probability that any of the m particles above 
leave v before time to is at most rj. Since a coalescent event involving any fc-tuple of particles occurs 
at total rate (increasing in k), and since at a given time s < to there are up to m of the above 
particles located at v, the probability of a coalescent event before time to in which two or more 
of the m above particles participate is at most i]. It follows that ¥{X^(t) > m) > 1 — 4r/, for all 
n> uq. Since r] can be made arbitrarily small, this proves our claim for v a neighbor of o. 

For other v we use induction in the distance \v\ to o. Indeed, such v has a neighbor u satisfying 
\u\ < \v\. For any fixed m',r], and n sufficiently large, we have P"'(£'m',e,?i) > 1 — Given this, and 
using the strong Markov property, one can repeat the previous argument with m' sufficiently large 
to conclude that with probability at least 1 — 2rj there will be at least m particles at v (arriving 
from u) at some time t < 2e. □ 

Proof of Theorem \1.'A Again, due to monotonicity in n and t, it suffices to show that for any m < oo 
and any t > 0, we have lim„_)>oo ^{N^ > m) = 1. 

Let r/, e > be small numbers. Fix m < oo, and choose a subgraph C G of size m such that 
the distance between any two vertices of G™ is larger than 1/r/. By Lemma 15.31 we have that 

F"(^i,,,,) > 1, yv G G"". 

n— >oo 

Moreover, if Ay^^ is the event that the first (if any) particle that enters v before time e stays at v 
up to time e (while it may possibly coalesce with other particles), note that ¥{Ay^^\Ei^£^y) > e~'"^. 
By choosing e sufficiently small we arrive to 

lim P(n„eG- A,e) >l-r]. 

n— >oo 

However, given nt,gG'™^v,e) the probability that any pair of the above particles (located at mutual 
distance greater than l/i] at time e) will coalesce before time t tends to as — )• 0. □ 

6 Lower bound for the long time asymptotics 

We now turn to the large time asymptotic behavior of spatial coalescents. The underlying measure 
A does not play an important role here as it did for the behavior at constant times. The reason 
for this is that, as explained in the introduction (Section II. 2p . at the beginning of this phase, say 
at constant time t > 0, the number of particles at each site is tight with respect to n. When the 
number of particles at a site is small, the coalescents corresponding to different choices of A behave 
similarly. In fact, the density of particles quickly decays, and once it is small enough, it rarely 
happens that more than two particles are at the same location. With at most two particles at each 
site, any spatial A-coalescent is equivalent to spatial Kingman's coalescent. 

An important quantity in this setting is the radius m of the region (ball) which is initially "filled" 
with particles. As we have seen, for Kingman's coalescent the radius of this ball is m = log* n, while 
in the case of Beta-coalescents it is approximately m = log log n up to constants. In the general 
case, the radius m should be a certain function of both n and A, namely m = f*{n) where /* 
is defined in ([4]). This was rigorously established only for Kingman's coalescent and those with 
"regular variation" (i.e., satisfying (|20p). However, the results which we present in this section and 
the next one, are valid for essentially arbitrary coalescence mechanisms (subject to (15ip for the 
upper bound in Section [7]), and assume that the spatial A-coalescent starts with a possibly random 
but tight number of particles per site in a large ball of radius m. See Theorem 16.41 for the full 
statement. Note that in this result as in the rest of the paper, we will be taking limits as m tends 
to oo, recalling that m is itself a function of n when applying these results to get Theorem II. 4[ 
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Let us first present some further lieuristic arguments for the lower bound in Theorem II. 4[ 
Consider for the moment the case d > 2, so that the random walk migration process is transient. 
The first heuristic comes from the first moment calculation and simple Green function estimates: 
label the particles in an arbitrary way and let Si be the total number of particles that ever coalesce 
with the ith particle. Observing that a typical particle is at initial distance of order k away from 
an order k'^~^ particles, where k ranges from 1 up to a number of order m, gives for a typical i 

m 
k=l 

where we use the fact that the probability that two particles ever coalesce is proportional to the 
probability that their corresponding walks intersect (visit the same site at the same time) (the 
constant comes from the delayed coalescence dynamics). The fact that this probability is approxi- 
mately k'^~'^ is a well-known Green function estimate. Since = X^^^x ^/^i gives the total number 
of clusters that survive forever (with M being the initial number of particles, of order m'^), and 
since E(l/S'i) > l/E(S'i) we arrive at 

E(iV) > cm'^-^ (33) 

While Jensen's inequality may seem crude, this does give the correct exponents because the distri- 
bution of Si is sufficiently concentrated. The next section contains results confirming this heuristic. 

6.1 Technical random walk lemmas 

We begin with technical results concerning random walks. Most of these are standard yet difhcult 
to "pinpoint" in the random walk literature. Let {Sn,n > 0) be simple symmetric random walk 
on Z^, started from a point Xq € B{o,2m) which will later be chosen in a certain random fashion 
(very roughly speaking, close to uniform) and recall that m — t- oo. Let {Xt,t > 0) be a continuous 
time random walk on obtained as Xt := Sj\j* where {Nl,t > 0) is a Poisson process with rate 
1, independent of X. Let S' be a lazy version of S, with S'q = Sq and step distribution given by 
^{S'n_^_l — S'j^ = 0) = 1/2 and F{S'^_^i — S'^ = ±ej) = 1/8, where ei,e2 are the coordinate vectors, 
then {S'j^, ,t>0) has the same law as X. We write for the corresponding probability measures 
when Xq = x. 

Define := inf{n > : = x}, ■= inf{s > : Xg = x} to be the hitting times of x. 
Similarly, let r'^ := inf{n > 1 : S!^ = x} denote the positive hitting time of x. We abbreviate 
r' = Tq, T = To etc. 

The next result is a variation of an Erdos- Taylor formula [18] (see also jl6j . p. 354). We assume 
that as 7n — )• oo, 



where for any y £ Z we abbreviate ||y||+ := ||y|| V 1. 

Lemma 6.1. Assume d = 2 and fix s > 4:, assume a random \\Xq\\ < 2m satisfies (j34p . Then 

P(r < W) X > (35) 

log 771 + log S 

where the constants implicit in the x notation depend only on those implicit in ([Ml) (and not on s 
or m ). 



28 



It is easy to check that Xq drawn from a uniform on B{o,2m) or from a difference of two 
independent uniforms on B{o, m) will satisfy the hypotheses of Lemma l6.1l and therefore (j35p with 
universal constants (not depending on m) for any fixed s > 4. Also note that if P(Xo ^ -B(o, 2m)) = 
1, under no further restriction on the distribution of Xq, the upper bound on the probabilities 
P(r < sm?) holds with the same constant as in Lemma l6. 11 Indeed, in order for r to happen, the 
walk needs first to enter B{o, 2m) at a location close to its boundary, for which the argument below 
gives the required estimate. 

Proof. We estimate the above probability for any given x S B{o,2m), and then integrate over 
the law of Xq. Without loss of generality, assume that sm^ is an integer. Use the "last-exit 
decomposition" : 

sm^ — l sm^ — l 

l{r'<.m2} = Yl hsi=0} n hs',m 
k=l j=k+l 

together with the Markov property, to obtain 

sm^ — l 

P^(t' < sm^) = ¥^{S'k = 0)Po(t'+ > sm^ -k-1). 

k=l 

We now apply a local central limit theorem and an estimate on the distribution of excursion length 
[36] . statement El on p. 167, (or [291 Prop. 4.2.4]) and [36] statement PIO on p. 79 (or [29l Theo- 
rem 2.1.1]). We find that for some universal sequence > 



-1 



[T < sm ) = } -e k - — -. 5 ; — + O 



k=\ 



srn? — \ ^ 1. /I 

= y 1^-^ l + e,^2_fc+i ^ ^ / 1 
^—^ k log(sm2 — /c + 1) vlkll + 

Split this sum in three: For k < use < to get a total contribution of 0(l/log(sm^)). 
For k > sm^ — V sm^ each term is at most C /k so the total contribution is 0{1/ {y/s-m)). Finally, for 
the intermediate /c's each term is x 1/k log sm^, so the total contribution is x log(sm^/||a;|p)/ log(sm^). 
Thus 

^ ^ log^+21^^ 1 




log(sm2 — + 1) 1 ^ A;||x||^ log(sm^ — + 1 



log(sm2) 



uniformly over x € i?(o, sm/2). Taking expectation with respect to Xq while using ()34p and s > 2, 
yields P(r' < sm^) x ^^^"l^^) as m — )■ oo. 

Going back to the continuous time random walk, we have P(|A^|j — 2t\ > et) < e~^^'^^^,t > 
for some c(e) > 0, accounting for an additional error of 0(e~'^('')*"* ) = o(l/log(,sm)) in the 
corresponding estimate for r. □ 

We will also need later a simpler result which goes along the same lines. 

Lemma 6.2. Assume d = 2 and Xq = x is such that \\x\\ = m. For all ci > 0, there exists C2 > 
which depends only on ci such that Px(to < ci?n^) > C2/ login. 
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Proof. First we note that by easy large deviations on Poisson random variables, it suffices to prove 
the same inequality with tq replaced by the discrete time Tq. By the strong Markov property, note 
that if K{t) counts the number of hits of by time t, then for all c > 0, 

E^{K{cm'^)) < P^.(r^ < cm'^)Eo{K {cm^)) . (36) 

By the local central limit theorem, 

cm^ cm?' ^ 

Eo(i^(cm2)) = 5^Po(5^ = 0) ^ E ~ 2c log m. (37) 

fc=0 fc=l 



Also, 

cm?' cm? _ 



c'||x||V(2fc) -c'mV(2fc) 



E,.(i^(cm2)) = J]Po(Sl. = x)x J]^^ ^ > —k (38) 

fc=0 fc=l k=cm^l2 



Combining (j36|) - p8|) . we complete the proof. □ 

If d > 3, we denote by Gx the Green function of a d-dimensional walk X. It is well-known (see 
e.g. [36]) that 

Gxip^) ~ as ||x|| — )■ oo, (39) 

for some constant c that depends on d (here denotes the Euclidean norm in 

Let X(-),y(-) be independent continuous random walks in Z'^ d > 2 with jump rate 1 and 
with starting points uniform in B{o,m). Denote by at{X} := a{X{s),0 < s < t} the natural 
filtration of X and let := aoo{X} = a{X{s),0 < s < oo}. Define the stopping time 

T := inf{t : X(t) = Y{t)}. Define the collision events by 

Hs = HsiX, Y) := {r <s}, H = H{X, Y) := {r < oo}. 

Lemma 6.3. Let X, Y be independent continuous time random walks in 'L'^ starting at uniform 
points at B{o,m). For any d > 2 we have 

F{H) X m^-'^, (40) 
Var {F{H\a{X})) < Cm'^^'^-'^\ (41) 

while if d = 2, for any t > A we have 

nHtm-) ^ ^ , (42) 

^ log m + log t 

Yav{F{H,^2\a{X}))<c(- ^"^^ ) , (43) 

\ log m + log t j 

where C and the constants in x relation depend only on d. 

Proof. Assume first that d > 3. Note that the difference X{t) — Y{t) is also a continuous time 
simple random walk (with a doubled rate of jumps), and abbreviate Gx-y = G. It is well-known 
and easy to check that 
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Since x i— t- is integrable near as a function on M'^, then ()39p implies that 

— ^ V G{x-y)<Cm^-'', for any rr G Z"^. (45) 

Vol Bio, m) ^-^ 

If X G B{o, m), then a corresponding lower bound holds since a positive fraction of points in B{o, m) 
is at distance order m from x. Hence, for any x E B(o,m), 

— Yl G{x-y)^m^-\ 

\o\B[o,m) ^ 

^ ' ' y&B{o,m) 

where the constants implicit in x depend only on d. Due to (j44p . averaging over x € B{o,m) gives 
that ¥{H) X m^~'^ as claimed. (It is not hard to show similarly that ¥{H) ~ crin?~'^ for some c.) 

In order to show (j4ip . introduce a third random walk Y' independent from, and identically 
distributed as, X and Y . In analogy to H define H' = {3t, X[t) = y'(t)}. Given (t{X}, the events 

H' are independent and have the same probability. Thus 

Yav¥{H\a{X}) < K[F{H\a{X}f] 

= E[¥{H\a{X})F{H'\a{X})] 

= E[P(3t, s : X(t) = Y{t), X{s) = Y'(s)\a{X})] 

< 2F{3t,s: t < s,X{t) = Y{t),X(s) = Y'{s)), 

where for the last inequality we use the symmetry between Y and Y' . Denote by Jv the standard 
cj-field generated by processes X and Y up to time r. On the event {r < 00} , due to the strong 
Markov property and 



F{3s > T : X{s) = Y'{s) \ Fr) < cE[G{X{t) - Y'{t)) \ JV] . 

Let Z = X{t) — {Y'{t) — y(0)). Noting that ^'(0) is independent from both Jv and Z, we have 
E{G{Z - Y'{0))\J='r, Z) < Cm'^-'^, almost surely, and therefore 



E{G{X{t) -Y\T))\Tr) =E[E{G{Z -Y\0))\Tr,Z)\Tr] <Cm^-'^. 

In view of the discussion above this yields a uniform bound on Var¥{H\a{X}). 

If c? = 2, we proceed similarly, with H replaced by Hi^2. In particular, Lemma l6.ll gives the 
asymptotics of ¥{Hf^2). For the conditional variance estimate, one obtains as above 

Yar¥iHt^2iX,Y)\a{X}) < 2P[l|,<i}P(F,^2(X", y")l-^r)], 

where X" ,Y" are independent random walks started from X{t) and Y'{t), respectively, and other- 
wise independent of Jv. The result follows as before, since by Lemma WA] P(ifi^2 (X", y")| Jv) x 

log m+log t ' 

6.2 Proof of the lower bound 

We return to the spatial coalescent. Let A be an arbitrary finite measure on (0,1). Consider 
a spatial coalescent with initial configuration X{0) that stochastically dominates i.i.d. Bernoulli 
random variables with mean p > in B{o, m) (we make no assumptions on the initial configuration 
outside of B{o,m)). With a slight abuse of notation, we write N"^{t) in this section for the total 
number of particles at time t, and we define = N"^ = limj_j>oo N''^{t) be the number of particles 
that survive to time 00. 
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Theorem 6.4. Consider the spatial coalescent with initial state dominating Bernoulli variables in 
B{o,m). If d>2, then there exist a constant a > such that 



P(iV > > 1. 

If d = 2, then there exists a constant a > such that, for any t > 4, 

N-itm^)>a^-^) .1. 

Note that, since the total number of particles is non-increasing, the lower bound in the d = 2 
case holds for any t > 1 with modified constant a (or with log(2 + 1) in place of log t for any positive 
t). 

We begin with a lemma stating a similar result for a simpler initial condition and with an 
"instantaneous" coalescent mechanism, where two particles coalesce as soon as they visit the same 
site. This model is called coalescing random walks (CRW). Afterwards we couple the two models 
to obtain Theorem [67 



Lemma 6.5. Consider a system of s coalescing random walks, such that their initial positions are 
i.i.d. uniform points in B{o,m), where 



s = s{a) 



am'^-'^, d>3, 
ologm, d = 2. 



Let Z{t) denote the total number of particles at time t and let Z = limt_j>oo Z{t). If d > 2, then for 
some a > we have F{Z > am'^~^/4) > 1. 

m— >oo 

If d = 2, then for some a and all t > A, we have P [zitrn^) > a '"^JVlt'?^ * ) > 1- 

Proof. We use the following explicit construction of the CRW model with the given initial condition: 
Let {Xi{t),t > 0),i = 0, 1, . . . , s — 1 be a family of i.i.d. (non-coalescing) random walks, such that 
for each i, Xi{0) is uniform in B{o,m). At time 0, each block contains a single particle that is 
assigned a unique label in {0, 1, . . . , s — 1}. While present in the system, the particle (or block of 
particles) carrying label i follows the trajectory of Xi. If the trajectories of blocks labeled i and j 
ever intersect, they instantaneously merge into a new block that inherits the smaller label i Aj. 

Consider first the case d > 2. For each pair i,j let Aij := {\/u > : Xi[u) ^ Xj{u)} = 
H[Xi,XjY. Then on Ai^j the blocks carrying labels i and j cannot merge as a consequence of a 
single coalescence event, but might merge due to a collection of coalescence events involving lower 
indexed particles. However, on the event 

Ak := fl Ak,i, (46) 

i<k 

the block carrying label k stays in the system indefinitely. 

Consider the filtration = cr|Xj(-),i < fc}. Define = P(Afc|J-fc_i), and note that po = 1. 
The random variables {pk}, are a non-increasing sequence of random variables. To see this we use 
the fact that the random walks are independent and so 

Pfc < F Ak^i\J='k_i^ = P Ak-i^i\J='k_2^ =Pk-i, almost surely. 



32 



Next, define events 

Bk = AkU{pk < 1/2}, 

and note that 

fl, Pfe<l/2, 



Pk, Pk > 1/2. 



Consider the martingale 

k 

Mfc = ^1b^ -P(S,|J-,_i). 

i=0 

Note that Mk has increments with variance bounded (crudely) by 1. Thus VarAf^ < s (here s is 
the initial total number of blocks) and, by Markov's inequality, 

, , , s 16 
PdM.I > s/4) < ^ = -. 

However, F^B^lJ-k-i) ^ 1/2, so by the definition of M, we find 



\i<s / 



-> 0. (47) 



Since pk is non-increasing and since on the event {pk > 1/2} the events Ak and Bk coincide, we 
realize that on the event {ps > 1/2} 

i<s i<s 

Thus if we prove that 

F{ps < 1/2) y 0, (48) 

then (j47p would imply the lemma. To this end we show that ps is bounded below by a random 
quantity that is concentrated above 1/2, via second moment estimates. Specifically, from the 
definition (1461) we have 



i<s Ks 



where the last identity is due to independence of cr{Xi} for different i's. Moreover, {IP(A^ JfjjXj}), i 
0, . . . , s — 1} is an i.i.d. family of random variables. Using (j40p . 



E [ F{Ali\a{Xi}) ] <s- Cm'^-'^ < Cc 

\i<s ) 



We choose a = 1/(4C) so that this expectation is at most 1/4. Due to (HU, 

,2(2-d) ^ Q 



Var ( ¥{Al,\a{Xi}) j < s • Cm^ 

\i<s / 



SO the sum is concentrated near its mean, and (1481) follows. 



In the case d = 2, the proof is almost identical. We take s = alogm and a < l/(4Clogt), 
where C is the constant that appears in (j42p . The event Aij is accordingly redefined as Aij := 
Hf.,-y^2{Xi, Xj)'^. Otherwise, the argument proceeds exactly as above, with (jl2]) . ([13|) used in place 
of (HDD, dUD. □ 
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Proof of Theorem \6.4\ The idea is to couple the spatial coalescent X with a system of coalescing 
random walks, denoted X^, with an initial state of s particles at i.i.d. sites, uniform in B(o,m). 
We first argue that it is possible to couple the initial states so that w.h.p. X^{0) < Xy{0) (at 
every vertex). Indeed, in X^, there are A^*(0) < s occupied sites (since there may be repetitions) 
and given S, these sites are uniformly sampled from the ball B{0,m) without replacement. On 
the other hand, X{0) dominates a Bernoulli configuration on B{o,m), hence X{0) has at least 
Bm{^B{o,m),p) particles sampled without replacement. Since ¥{Bm{^B{o,m),p) > s) — )• 1, this 
holds. 

The second step of the proof is that if the initial configurations satisfy ^^(0) < ^1,(0) for all v, 
then there is a coupling of the processes so that 

X„{t)>X^{t), t>0,veV. (49) 

To see this, observe that by the consistency property of spatial A-coalescent it suffices to prove the 
result assuming that -^^(0) = ^^^(0) for all v £ V. In this case, (jl9|) follows easily by induction on 
the number of particles: Just apply the consistency property of spatial A-coalescents, after the first 
time that two particles occupy the same site. (This idea is further exploited in Lemma |7.2[ ) 

Finally, Theorem 16.41 follows by Lemma 16.51 □ 

6.3 Concentration of the number of particles 

The main result of this section is a concentration result for the number of particles alive at a certain 
time. This provides a soft alternate route for the lower-bound on the long-time behavior of the 
spatial coalescent, as we briefiy explain. 

Theorem 6.6. Fix t > 0, and consider a spatial Kingman coalescent started from some arbitrary 
configuration containing a finite number of particles. Then we have 

Var(iV(t)) < EiV(t). 

Proof. The tool used here again is a comparison to the coalescing random walk model, where par- 
ticles coalesce immediately upon meeting. We denote by {X'^{t),t > 0) a system of instantaneously 
coalescing random walks started from a certain set of vertices A in a graph G = (V, E), to be chosen 
suitably later, and let A^^(t) denote the total number of particles at time t. The proof is based on 
Arratia's correlation inequality [21 Lemma 1], which states that 

Ext{t)x;{t) < Ex:(t) • Ex;it). (50) 

Thus at any time, any two sites are negatively correlated. This inequality holds not just for the 
process on Z'^, but on any edge weighted graph. 

We now remark that the spatial Kingman coalescent on Z"' can be approximated by a system 
of instantly coalescing random walks on a larger graph. For any integer N such that N > n (the 
initial number of particles), consider the graph Gn = {y,E) with vertices V = Z,'^ x {1,...,A^}. 
The edges of Gj\f are of two types. If x ~ y in Z*^ then there is an edge between {x,i) and (y, j) 
with weight p/N. Additionally, there is an edge with weight 1/2 between {x,i) and {x,j) for any 
X, i,j. Call the set x x {1, . . . , N} a cluster. Clusters correspond to vertices of Z"^ in a natural way. 
The Z*^ coordinate of a continuous time random walk on Gat is a continuous time random walk on 
Z*^ with jump rate p. However, two walks may be present in the same cluster and not meet. It is 
clear that as long as two random walks are in the same cluster they will meet at rate one (since 
each may jump into the vertex occupied by the other). 
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The probability of two random walks meeting when one jumps from one cluster to another is 
of order Thus as long as the number of particles is negligible compared to N, the projection 

onto Z*^ of the coalescing random walks on Gn is close to the spatial Kingman coalescent on 
Z"^. As — 7- oo, the projection of X^{t) converges to X{t) (in the sense of vague convergence, 
identifying X^ and the projection of XJ^ to point measures on Z^). More precisely, for an initial 
configuration X{0) of particles on Z'^, we define a set A C Vat by choosing for each v (arbitrarily) 
X^(0) particles from the cluster of v. Let X^{t) be the process of coalescing random walks on Gjy 
started with this configuration. Then if M^{t) denote the total number of particles of X^{t), 

E(M^(t)2) = ^ EX%/x) + Yl KX%^ti^)X%^tiy) 
< EM^(t) + (EM^(t))2. 

Thus for any we have VarM^(t) < EM^(t). By dominated convergence (since all processes have 
at most n particles) we see that 

lim EM^(t)=EA(t) lim EM^{tf = EN{tf , 

and the result follows. □ 

As a simple corollary of this result, we obtain an alternate proof of Theorem 16.41 We have 
already seen in (j33p that E(A(oo)) > cm'^^^ for some c > if d > 3 (this argument is a simple 
Green function estimate, and is easy to adapt to the case d = 2). Applying Theorem 16.61 concludes 
the proof. 

It would be also possible to derive a lower-bound on the expected number of particles in a system 
of instantaneously coalescing random walks at time tm?, starting from a set A which dominates 
i.i.d. Bernoulli random variables with mean p > 0, using technology from coalescing random walks. 
We briefly outline the steps needed to do this. First, starting from a configuration where there 
is a particle at every site of Z'^, and using a famous result of Bramson and Griffeath |14j on the 
asymptotic density of particles, we conclude that about cm*^"^ such particles are in a region of 
volume Cm'^ for some large C > to be chosen suitably. If we treat the particles that started 
outside of A as ghosts, we are then led to estimate the number of ghost particles among those 
Pqj. ^j-^jg^ Qjjg pg^j^ ^]^g (duality with the voter model (see [30]) and [321 Lemma 4], which 
gives good control on the probability that the voter model escapes a ball of radius \/t, for large t. 



7 Upper bound for the number of survivors 

Assume that A is a finite measure on [0, 1] such that for some oq > 0, we have 

An > CLon for all n > 2, (51) 

where = ^^^=2 '^b,k is the total merger rate when there are n particles. Note that most coalescents 
which come down from infinity satisfy (j5ip . in particular, if A = (5|o} (the Kingman case) then (j5ip 
holds since An = (2), and if A has the regular variation property of (|20l) . then (fSTl) holds by 
Lemma 14.11 

Our goal here is to prove the following result. 
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Theorem 7.1. Fix Cq G (0, oo) and 5 > 0, and consider the spatial A-coalescent in Z"^ satisfying 
i51\). started from a configuration of at most C^m'^ particles located in B{o, m), and no particles in 
Z*^ \ i?(o, m). There exists C = C{6, Cq), such that if d > 2 then 

P (N*{5m^) < Cm"^-^] > 1, 

while, if d = 2, 

P {N*{5n?) < Clnm) > 1. 

Note that when d > 2 this order of magnitude bound is sharp, since Theorem [63] shows A^*(oo) > 
cm'^~'^. For d = 2, due to recurrence, A^*(oo) = 1, almost surely. 

The idea behind the proof is a comparison of the spatial system to a mean field approximation. 
The actual argument is based on a somewhat technical construction so we start with a non-technical 
overview. Recall the comparison with ODE described in ([5]): if at time t the density of particles 
averaged over some ball is p{t) (typically small), then we approximate the spatial coalescent with the 
mean-field model where the coalescence rate per particle is p{t) at time t, leading to the differential 
equation 

j^pit) = -p\t)/2, t>s. 

Hence p{t)~^ = c + (t — s)/2 and therefore p{t) = ^_g_^2p(s)-^ ' ^ — ^- Provided that all the particles 
in the spatial coalescent configuration are located in the ball of radius m during the whole interval 
[s, t] (and that the above approximation is valid) then their total number is approximately Cm'^p{t). 
In turn, this approximation remains valid as long as the particles remain inside a ball centered at 
the origin with radius of order m, i.e. up to time of order m^. At times of order m^, the number of 
remaining particles is of order m'^~^. 

A key difficulty of the approach outlined above comes from the fact that some particles diffuse 
away from the densest regions relatively early in the evolution, which might enable them to survive 
longer. To account for such "runaways" , we adopt a multi-scale approach, bounding at each stage the 
number of particles that "escape". This is done in Lemma l7. 81 Lemma 17.51 provides the estimates 
on the number of non-escaping particles at each stage. 

To justify the comparison of the spatial process with the mean field process we average over 
small time intervals (cf. Lemma 17.41 below) . This is necessary since at any given time it is possible 
that no vertex contains more than a single particle, in which case the immediate rate of coalescence 
is 0. However, the system is unlikely to stay in such states long enough to hinder the approximation. 
Indeed, Lemma [73] implies that the average rate of coalescence is (up to constants) as predicted by 
the mean field heuristic. The multiplicative constants are inherent to the spatial structure, and it 
seems difficult to compute them. 



7.1 Preparatory lemmas 

Our first step is a comparison lemma between the spatial A-coalescent X and a slower spatial 
coalescent. We then consider a possibly more general spatial coalescent process {{Xy{t),t > 0)}y^v 
If the process consists initially of n particles labeled by [n] = {1, . . . ,n}, a configuration consists 
as usual of labeled partitions of [n], where the label of a block corresponds to its location on V. 
Equivalently, a configuration x = {xy)y^v may be thought of as giving the list of blocks (referred 
to as particles) present at each particular site v G V. We will also sometimes abuse notation and 
denote by Xy{t) the number of particles (i.e., blocks) present at time t and at position v. We 
assume that particles perform independent continuous-time simple random walks with jump rate p, 
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and that there exists a family of real numbers Xx^s such that for all configuration x = {xy)y^Vj 
V eV, any particular subset S of all blocks present at v € V coalesces at an instantaneous rate Xx,Si 
if the current configuration is x. Moreover, coalescence events at different sites occur independently 
of one another, and are independent of the migration. We now make the following assumption on 
the family of rates Xx,s- H v £ V and x^, contains n > 2 particles, then for every 2 < k < n, we 
have: 

A^,5<5^(^)a„,,, (52) 

where Xn,k is the coalescence rate of any particular subset of size k in a A-coalescent. The idea 
behind (j52p is that if X and X have the same number of particles at time t, then X{t + e) is 
stochastically dominated by X{t + e). 

Lemma 7.2. Consider a A-coalescent X and a coalescent process X such that i52\) holds, and 
Xv{0) < Xy{0) for all v. Then there is a coupling of the processes X and X such that Xy{t) < Xy{t) 
holds for all V £ V and t >0. 

Proof By the consistency of spatial A-coalescents, it suffices to prove the result when Xy{0) = Xy{0) 
for all V €V. We associate each particle of X with a particle of X and let them perform the same 
random walks as long as there are no coalescence events. A consequence of (j52p is that it is possible 
to couple the processes so that if Xy = X^ then the coalescence events of X dominate those of X, 
that is, any coalescence event in X occurs at the same time as an event in X involving at least as 
many particles. 

The proof now proceeds by induction on the total number of particles, which are allowed to 
be distributed arbitrarily. By the above remark, we may couple the processes X and X so that 
the domination holds up to and including the first time to of a coalescence event, which involves 
particles from X and possibly from X. Assume that X also experiences a coagulation event at this 
time. (Else, we can artificially retain particles in X that were supposed to coagulate at time to- By 
the consistency property, this may only increase the process X stochastically.) 

We now use the induction hypotheses to construct processes {X'{t),t > to) and {X'{t),t > to) 
with initial configuration X'{to) = X'{to) = X{to) such that X'^{t) < X'^{t) for ah t > to. We 
can define X{t) = X'{t) for t > to, and by consistency of the spatial A-coalescents, we extend the 
coupling to X for t > to so that Xu{t) < X'^^^t) for all u £V, which proves the claim. □ 

Remark. This lemma holds for more general spatial coalescents: e.g., the instantaneous coalescence 
rates Xx,s could be allowed to be arbitrary path-dependent (i.e., J^t-measurable at time t), almost 
surely nonnegative and finite random variables. The only crucial assumption is that (j52p holds 
uniformly. 

We now apply Lemma 17.21 to the situation which is particularly useful in our setting. Recall 
that we are considering a spatial A-coalescent for which (I5ip holds. Assume that initially there are 

particles, and let {Xy{t),t > 0}v£v denote the number of particles of this process as a function 
of time and space. 

Let vr be a partition of {1, ... , N}. We refer to the blocks of vr as classes. Let {Xy{t),t > 0}v£V 
denote a process where classes evolve independently of one another, and particles within each class 
evolve according to a spatial (A)-coalescent, where A will be specified soon. That is, particles move 
as continuous-time simple random walks with rate p and coalesce when they are on the same site 
and from the same class according to a A-coalescent. 

Lemma 7.3. Assume that the blocks of tt are all of size 1 or 2, and that A = (ao/A2)A, where ao is 
the constant of h51\) and X2 = A2,2 is the pairwise coalescence rate. Assume also that Xy(0) < Xy{0) 
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for all V. Then there is a coupling of the processes X and X such that X^{t) < X^{t) holds for all 
V £V and t>0. 

Proof. Observe first that our process X is of tlie type described above Lemma [7.21 so tliat it suffices 
to establisfi (I52p . Note however that if a configuration x contains n particles at site v, and S is 
a subset of particles with \S\ = k and 2 < k < n, we have \x,s = for k > 3, while if A: = 2, 
^x,s = when the particles of S are not of the same class, and if they are of the same class, 
^x,s = {0-0/^2)^2 = oo- Since there are at most n subsets of particles that are allowed to coalesce, 
we have 



^x,s < nap < An = f ^ j 

S:|S'|>2 k=2 ^ ^ 

which proves ()52p . and completes the proof. □ 

Lemma 7.4. Fix co,Co, and consider a spatial A-coalescent satisfying Ii51\) with A^*(0) particles 
all inside B{o,R). Let p{t) = be the inverse density, and assume p{0) £ (coi^"^Co). Denote 

T = p(0)~^/'^. Then for d > 2 we have 

P(p(r)-i < p(0)"i + cir) < exp (-ci?('^-2)Vdj ^ 

where c\ depends only on d,co,Co,ao- 
If d = 2 we have 

^)<e.p(--S|-). 

Proof. We first argue that for some C = C(d), it is possible to find at least A^*(0)/4 disjoint pairs in 
the set of initial particles, so that for each pair the initial distance between its particles is at most 
Cp(0)~^/'^ (for large p(0), the particles forming such a pair are initially located at the same site). 
To achieve this, cover B{o,R) with iV*(0)/2 (disjoint) boxes of diameter CRN*{fdy^l'^ = Cp-^/"^ 
(this is possible for some C). Within each box match as many pairs as possible in an arbitrary 
manner. This leaves at most one unpaired particle in each ball, so at least A^*(0)/2 are matched, 
with all distances bounded as claimed. Refer to two particles forming a pair as "partners" . 

Consider the coupling from Lemma 17.31 where vr corresponds to the partitioned formed by 
identifying particles with their partners (which therefore contains only singletons or doubletons). 
Let Z' be the total number of coalescence events in the process H' where coalescence events involving 
members of different classes are not allowed and occur at rate qq when they are. Lemma [7.31 implies 
that Z' :< Z, in the sense of stochastic domination. Hence, it suffices to prove the claimed bounds 
for Z'. The advantage of considering H' instead of H is that different pairs of partners evolve 
independently. 

From this point on, the arguments for the cases d = 2 and d > 2 differ. In dimensions d > 2, by 
our assumptions, r > cq for some cq. The probability that random walkers started at distance at 
most p~^/'^ meet before time r/2 = /3~^/'^/2 is at least cp'-'^"^^/'^. On this event, there is probability 
bounded from that they coalesce before time r. Thus the number of partners that coalesce by 
time r dominates a Bin(A^*(0)/4, cp^'^~'^^^'^) random variable. This random variable has expectation 
cN* {fd) p^'^^'^^ / > cR^'^"'^^ 1'^ . The bound in the lemma is the probability that this random variable 
is less than half its expectation. 

Finally, if the number of coalesce events is at least cN* {fS)p{fSf''^~'^^l'^ = cN*{0)p{0)t then 
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In the case d = 2, each pair coalesces with probabihty at least c/logr (along the same lines) by 
Lemma [6. 2 [ As above, the number of coalesce events is at least cR'^ /rlogr except with probability 
g-cR /rlogr _ ^]^-g gvent, a similar computation gives 

PT^ > Po^ + V—. □ 
^ ^ rlogr 

Lemma 7.5. Let St denote the number of particles (in the spatial coalescent) that remain in B{o, R) 
during the whole interval [0,t]. In particular, Sq = ^^^^sioR) ^f(O)- -^^2; Cq > 0, and assume that 
5o = n < CqR"^ and that 2<t <R^. Then for some C depending only on d, 



>j^< cR^ exp ( - cii('^-2)Vrf) ifd>2, 



C 



and 



St Clost\ , I , 

^ > ) < (logt)'^ tfd 



Proof. As usual, the case d > 2 is considered first. The previous lemma can be formulated as follows: 
The process {p^^} is unlikely to spend more than u~'^^'^ units of time in the interval [u,u + ciu~'^^'^]. 
Note that St can only decrease faster than N*{t), so this will also hold for the modified density 
Pit) = f . 

We apply this to the following sequence of intervals. Let uq = p{0) and u^+i = + ciu^ 
Let K be minimal with uk > t/ci. As long as uj < t/ci the increment is at least ct~^/"'. It follows 

that K < Ct^^'^/'^ < R^. If the process does not spend more than u^^'^ time in [iifc,Mfc+i] then 
the time before exceeds i/ci is at most t. The probability that this fails to hold is at most 
i22exp(-rf-2)Vd). 

This works provided X > 1, or equivalently t > p(0)~^/'^. If t < p{0)~^^'^ then we have 

st<so = pm'<^<^- 

In the case d = 2, we instead have Uk+i = + It is not hard to see that 



for some c depending on uq. To this end, note that u is increasing and hence is dominated by the 
solution of the ODE /' = ci// log / (at least once u is large enough that u/ log u is increasing). This 
ODE is solved by / = ^V^nx+x^ giving the upper bound on u. For the other direction, note that once 
Ufc is large Uk+i/uk is close to 1. This implies that u dominates a solution of /' = (ci — e)//log/. 

Lemma tells us that is unlikely to spend more than units of time in [uk,Uk+i], and 
the probability of this unlikely event is at most 

Let K be such that uk > at/logt, with a small to be determined soon. Note that now the 
failure probability for the last intervals is of order 1, so a union bound does not work. However, 
Lemma 17.41 tells us more. If the process p^ fails to exceed Uk+i in the next Uk units of time, then 
by the Markov property and Lemma [7.41 again, it gets a fresh chance to do so in the next Uk units of 
time. Therefore, the number of attempts is smaller than a geometric random variable with success 
probability 1 — pk- It follows that the total time spent in [uo,uk] is stochastically dominated 
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by Q := ^^^^ Geom(l — Pk), where the geometric random variables are independent. The 
probabihty we wish to bound is therefore at most P((5 > t). By making a small we can guarantee 
Pk < 1/2 for all k, so that the geometric variables are typically small. 
More precisely, from (|53p it follows that K x log^ t and therefore that 



K 

e^^dt X VKuk X at, 



k<K 



hence for small enough a we have < 2^^^^^ Uk < t/2. Similarly, we can compute 



VarQ <C^ul< 



logt 

k<K ^ 

The lemma now follows from Chebyshev's inequality and choosing a small enough that Aa'^C < 
1. □ 

The following is a fairly standard fact which follows easily from the optional stopping theorem 
and Doob's inequality: 

Lemma 7.6. If Xt is a continuous time random walk on Z"^ then for all x £ Z'^, 



sup 

s<t 



where c, C depend only on d. 

Lemma 7.7. If W = Bin(n,p), and A > 0, then 

F{W > 2np + A) < A^^/"^ . (54) 

Proof. If A > (np)'^, then Markov's inequality gives ¥{W > A) < up/ A < A^-*^/^. If A > (np)'^, one 

can use Chebyshev's inequality to obtain ¥(W > 2np + A) < ¥(W — KW > np) < {1 — p)/ (np) < 
A-V2. □ 



7.2 Completing the proof 

Lemma 17.51 is almost sufficient to deduce Theorem 17.11 The missing piece is to account for the 
particles that "escape" from the ball under observation. We accomplish this by partitioning the 
time interval [0, m^] into several segments and applying Lemma [7.5l to each segment. More precisely, 
let K = K{m) = [log log mj, and consider the process at a particular sequence of times given by 



] k = 0, 

.k-K^2 k=l,...,K. 



Thus ti X 7v?/lnm, and the sequence increases geometrically up to tx = "i^- At each time t^, we 
will consider the behavior of the process with respect to the ball B{o,Rk), where the radii Rk are 
defined by: 

fo k = 0, 



Rk 



7(m + ^tk{K + l - k)) k>0, 
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where 7 > 1 is some constant to be determined during the proof of Lemma 17.81 Note that is 
increasing in fc, and that Rk = 2jm. 

With the above notations in mind, let (resp. Yj.) be the number of particles inside (resp. 
outside) B{o,Rk) at time tk- Let Am = Amia, P,^) be the event 

Am = \ Xk < and Yk < ^ for all A: < K ^ when d> 2. 

[ tk tk I 



and 



Am = iXk<^^i^ and y,<^Mi^ forallfc<4 when d = 2. 
tk tk J 



Lemma 7.8. Assume the initial conditions of Theorem 7.1. Then for some choice of a,j3,^ we 
have ^{Am) > 1. 

■m— >oo 

Proof. The idea is to inductively bound Xk+i^^k+i in terms of Xk,Yk. The bound on XkJ^i is 
mostly an application of Lemma [7.51 However, to bound Xk+i we need to also account for particles 
that are outside the ball -B(o, Rk) at time tk, or particles that exit the ball -B(o, Rk+i) at some time 
before time tk+i and re-enter it. These quantities can be bounded in terms of Yk and Xk as well as 
auxiliary quantities introduced soon. A delicate point in the proof is that the number of steps of 
the induction is not fixed (Inlnm), so we make sure that constants do not grow with m. Thus all 
constants below depend only on d. 

At time tQ our assumptions are that Yq = and Xq < Cqtu'^. For the induction step we define 
two additional quantities: Sk and Zk- Let Sk be the number of particles that remain in B{o,Rk) 
throughout the time interval [tk-i,tk]- We wish to apply Lemma 17.51 to 5"^. The conditions are 
clearly satisfied (recall Sk < Cqtu'^ < CoRf, since 7 > 1). Since tk — tk-i > ctk this will imply that 
with high probability 

Sk<Ci^ for all k<K. (55) 
tk 

(The probability of failure at each of log log m steps is exponentially small.) 

Let Zk be the number of particles located inside B{o, Rk) at time tk that exit B{o, Rk+i) before 
time tk+i. Lemma 17.61 bounds the escape probability for each of Xk particles inside B{o,Rk)- 
Coalescence can only reduce the number of escaping particles, so given Xk, 

Zk ^ Bin (xk,C2 exp ( -cj^'-'' " ^'^^ 



tk+i — tk 



Here 02, C2 depend only on d, and denotes stochastic domination. 
If = this implies 

Since ir? jtx x Inm, by making 7 large enough we obtain EZq = o(l). Then in particular Zq = 
with probability tending to 1. For k > 1 we use Lemma 17.71 with A = K"^ to find 



P {Zk > + 2EZk}^ < K/^ = l/K. 



Thus with probability at least 1 — K ^ — )• 1 (as ?n — )• 00), we have 

Zk<K^ + 2C2Xk exp ("-02 ^^^+'"^^^' ^ for ah k. 

V tk+i - tk J 
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Using tk+i = etk, k > 1, together with y/tk+i{K 
with high probabihty, Zq = and 



^e/2y^tk{K + 1 — k), we conclude that 



Zk<K^ + 2C2Xke 



for ah k< K. 



(56) 



With these preparations in place, we are ready for the induction. Assume from here on that 
Zq = and ([55]) . ([56]) hold. We have, for each k, the deterministic bounds 

Xk < Sk + Yk_i + Zk_i, 
Yk<Yk^i + Zk-i. 

To see this, note that particles in B{o,Rk) either stayed inside (5), started outside {¥), or exited 
and returned (Z). The bound on Y is similar. 

We now carry out an induction over k to bound Xk, Yk for all k = 1, . . . , K . Suppose that the 
bound (from the event A^) on Xj and Yj hold for all j < k. It follows from (j56p and the inductive 
hypothesis that 

j<k i<i<fc 



< kK^ + 2C2a^ Yl -e-''^"^''-^^ 

tk . tj 

j<k 



We require 7 to be large enough that 037^ > 2. Then the last sum is at most l/(e — 1) < 1 and so 

Yk < kK^ + 2C2a^e-=37^(i^-fc). (57) 

tk 

This proves the induction step for Yk with any choice of /5 > 2C2OL (since kK^ < < Ri/tk), on 
the event from (j56p . for all sufficiently large m. 

It remains to bound Xk, for which we will use the bounds on S^, Yk-i and Zk-i- We already 
have ^ 

Yk-i <{k- l)K^ + 2C2a;^e-'=37^(J^+i-fe) < _ i);^4 ^ 2eC2a:^e-=3^'. 

Using the induction hypothesis and ()56p (with /c — 1 replacing A;) one finds 

Zk-i <K^ + 2C2a^e-^37^(^+i-'=) < i^^ + 2eC2a^e-'-^^\ 
tk-i tk 



Thus we have 



Xk <Sk + n-i + Zk-i <Ci^ + kK^ + AeC2a^e-^-'^' 

tk tk 



kK" +[Ci + AeC2ae-^-'^' ] ^ . (58) 

tk 
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To finish the proof it remains to select a and 7 (and (3 > 2C2a) so that 

Ci + 4eC2ae-^^T' < a. 

2 

This is done by requiring 7 to satisfy 6*^37 > 4eC2 and taking any sufficiently large a. 

Turning to the case d = 2, we proceed along the same lines. Lemma 17.51 gives with probability 
with high probability 

< Ci ^k log"^ k<K. (59) 

The failure probability is log^^m at each of log log m steps. Note that logtj ~ logm for all i, so 
we are not giving much away here. Furthermore, w.h.p. Zq = and (j56p holds (the proof of these 
facts does not depend on d. 

We now repeat the induction. Given ([56]) and the induction hypothesis bounds on X we get (as 
above, with an extra logm factor) 

n < kK^ + 2C2a^^i^e-37^(i^-^). 

Since the bounds for Sk, ifc-i and Z^-i differ from the general case only by a logm on the Rl/tk 
term, the bound for gets the same factor as well. □ 
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